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PROKARYOTTC EXPRESSION OF MHC PROTEINS 
5 The present application is a continuation-in-part of U.S. S.N. 08/143,575 

filed October 25, 1993. The entire application is incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
The invention relates to methods of producing compositions for the 
modulation of T cell function in the treatment of for example, autoimmune diseases, 
10 allergic responses, transplant rejection, and other immunological disorders. In particular, 
it concerns production of major histocompatibility complex (MHC) class I and class II 
proteins in prokaryotes that have been transformed with nucleotide sequences that code 
for the proteins. The MHC proteins are useful for making complexes that target T cells. 
The complexes comprise the MHC proteins and peptides representing fragments of 
15 antigens associated with the particular diseases. These complexes can be further 

conjugated to radioisotopes or other labels for diagnostic purposes, or to toxins or other 
substances which render the complexes therapeutically useful. 

A number of pathological responses involving unwanted T cell activation 
are known. For instance, a number of allergic diseases have been associated with 
20 particular MHC alleles or are suspected of having an autoimmune component. 

Other deleterious T cell-mediated responses include the destruction of 
foreign cells that are purposely introduced into the body as grafts or transplants from 
allogeneic hosts. This process, known as "allograft rejection, " involves the interaction of 
host T cells with foreign MHC molecules. Quite often, a broad range of MHC alleles 
25 are involved in the response of the host to an allograft. 

Autoimmune disease is a particularly important class of deleterious 
immune response. In autoimmune diseases, self-tolerance is lost and the immune system 
attacks "self tissue as if it were a foreign target. More than 30 autoimmune diseases are 
presently known; thfese include many which have received much public attention, 
30 including myasthenia gravis (MG) and multiple sclerosis (MS). 

The involvement of the MHC Class II proteins in autoimmune disease has 
been shown in animal models. Administration of antibodies to either MHC Class II 
proteins themselves or antibodies to agents that induce expression of the MHC Class n 
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genes interferes with development of the autoimmune condition in these model systems. 
The role of helper T cells has also been demonstrated in these models by counteracting 
the autoimmune system using anti~CD4 monoclonal antibodies; CD4 is the characteristic 
helper T cell receptor (Shizuru, J. A. et ah, Science (1988) 240: 659-662). 

Recent experiments have shown that, under certain circumstances, anergy 
or nonresponsiveness can be induced in autoreactive lymphocytes (see, Schwartz, Cell 
(1989) 1073-1081). In vitro experiments suggest that antigen presentation by MHC Class 
II molecules in the absence of a co-stimulatory signal induces a state of proliferative non- 
responsiveness in syngeneic T cells (Quill et ah, X Immunol. (1987) 138: 3704-3712). 
As described by Sharma et al (Proc. Natl Acad. Scu USA (1991) 88: 11465-11469) 
anergy can be induced in vivo and autoimmune disease can be effectively treated in this 
manner. 

MHC polypeptides thus have several pharmaceutical uses. However, to 
realize the potential of these types of treatments, a source of abundant MHC polypeptides 
is needed. MHC polypeptides have been expressed in mammalian cells. For example, a 
soluble form of the mouse I-E* protein has been expressed in CHO cells (Wettsttein 
et.al., 7. Exp. Med. 174: 219-228 (1991)). The expression levels from mammalian 
systems, however, are not sufficient for economical production of MHC polypeptides on 
a commercial scale. Moreover, mammalian cells load the MHC peptide binding pocket 
with endogenous peptides, necessitating peptide removal from the MHC. Thus, the prior 
art lacks methods for producing large quantities of therapeutically active MHC 
polypeptides at low cost. The present invention addresses these and other needs. 

SUMMARY OF THE INVENTION 
The present invention provides compositions comprising recombinant 
MHC polypeptides having altered glycosylation, in which the recombinant MHC 
polypeptide binds an antigenic peptide. The transmembrane domain is lacking from 
some of the recombinant constructs which are disclosed. The MHC polypeptide . 
compositions are expressed in a prokaryotic host cell such as E. coli utilizing an 
expression vector containing a nucleotide sequence encoding the MHC polypeptide. 
MHC class II recombinant polypeptides are disclosed, including the a- and 0-chains. 
Multiple recombinant MHC polypeptides are optionally associated to form active MHC 
compositions. 
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The present invention includes a method of producing an MHC 
polypeptide comprising the steps of (a) growing in a culture prokaxyotic cells containing 
an expression vector comprising a nucleotide sequence encoding the MHC polypeptide 
under such conditions that the polypeptide is expressed; and (b) extracting and isolating 
5 the MHC polypeptide. The method provides for the expression of two MHC 

polypeptides in a single prokaryotic cell, wherein the polypeptides form a heterodimer. 
The compositions produced by the method are also the subject of the invention. 

The invention further provides for a prokaryotic expression vector 
comprising a nucleotide sequence coding for an MHC polypeptide operably linked to a 

10 prokaryotic promoter sequence. A signal sequence operably liked to the MHC 

polypeptide sequence is optionally included in the vector. The nucleotide sequence for 
the MHC polypeptide can encode a truncated MHC polypeptide, or an MHC polypeptide 
lacking a transmembrane domain, in addition to full-length MHC polypeptides and other 
constructs which arc derived from the full-length MHC polypeptide. The vector can be 

15 used to transform a prokaryotic cell such as E. coli. 

The invention additionally provides for a substantially pure MHC-peptide 
complex consisting essentially of an antigenic peptide and an isolated recombinant MHC 
component having altered glycosylation and an antigen binding site, wherein the antigenic 
peptide is associated with the antigen binding site. The peptide is typically between 

20 about 8 and about 30 amino acids, but can be shorter or longer. The peptide can be 
noncovalently associated with the antigen binding site. The present invention includes 
peptides which are autoantigenic and thereby associated with an autoimmune disease. An 
epitope on the peptide can be recognized, for example, by an autoreactive T cell 
associated with multiple sclerosis, rheumatoid arthritis, or myasthenia gravis. Suitable 

25 peptide include those comprising residues 138-167 of human AChR a subunit, residues 
84-102 of human MBP, and residues 148-162 of human MBP. 

Also provided are pharmaceutical compositions comprising a 
pharmaceutical^ acceptable carrier and the recombinant MHC-peptide complex. An 
example of the pharmaceutical composition includes a composition in which the MHC- 

30 peptide complex is embedded in a liposome. 
Definitions 

A "nucleotide sequence encoding an MHC polypeptide" is a subsequence 
or full length polynucleotide sequence which, when present in a cell, expresses an MHC 



WO 95/11702 



PCT/US94/12231 



4 

polypeptide. In the expression of recombinant constructs one of skill will recognize that 
the inserted polynucleotide sequence need not be identical and may be "substantially 
identical" to a sequence of the gene from which it was derived- As explained below, 
these variants are specifically covered by this term. 

In the case where the inserted polynucleotide sequence is transcribed and 
translated to produce a functional polypeptide, one of skill will recognize that because of 
codon degeneracy a number of polynucleotide sequences will encode the same 
polypeptide. These variants are specifically covered by the above term. In addition, 
reference to polynucleotides of the invention specifically includes those full length 
sequences substantially identical (determined as described below) with an MHC gene 
sequence and that encode proteins that retain the function of the MHC polypeptides. 
Thus, in the case of the sequences encoding MHC single subunits disclosed here, the 
term includes variant polynucleotide sequences which have substantial identity with the 
sequences disclosed here and which encode proteins capable of binding antigenic peptides 
and binding a T cell receptor. The polypeptides of the present invention can consist of a 
full length MHC subunits, or a fragment thereof. 

Two nucleic acid sequences or polypeptides are said to be "identical" if the 
sequence of nucleotides or amino acid residues, respectively, in the two sequences is the 
same when aligned for maximum correspondence as described below. The term 
"complementary to M is used herein to mean that the complementary sequence is identical 
to all or a portion of a reference polynucleotide sequence. 

Sequence comparisons between two (or more) polynucleotides or 
polypeptides are typically performed by comparing sequences of the two sequences over 
a "comparison window" to identify and compare local regions of sequence similarity. A 
"comparison window", as used herein, refers to a segment of at least about 20 
contiguous positions, usually about 50 to about 200, more usually about 100 to about 150 
in which a sequence may be compared to a reference sequence of the same number of 
contiguous positions after the two sequences are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted by the 
local homology algorithm of Smith and Waterman Adv. Appl Math. 2: 482 (1981), by 
the homology alignment algorithm of Needleman and Wunsch J. MoL BioL 48:443 
(1970), by the search for similarity method of Pearson and Lipman Proc. NatL Acad. 



ScL (U.S.A.) 85: 2444 (1988), and by computerized implementations of these algorithms. 
Typically, the program providing the highest percentage identity is used. 

"Percentage of sequence identity" is determined by comparing two 
optimally aligned sequences over a comparison window, wherein the portion of the 
polynucleotide sequence in the comparison window may comprise additions or deletions 
(i.e., gaps) as compared to the reference sequence (which does not comprise additions or 
deletions) for optimal alignment of the two sequences. The percentage is calculated by 
determining the number of positions at which the identical nucleic acid base or amino 
acid residue occurs in both sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of positions in the window of 
comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. 

The term "substantial identity" of polynucleotide sequences means that a 
polynucleotide comprises a sequence that has at least 70% sequence identity, preferably 
at least 80%, more preferably at least 90% and most preferably at least 95%, compared 
to a reference sequence using the programs described above using standard parameters. 
One of skill will recognize that these values can be appropriately adjusted to determine 
corresponding identity of proteins encoded by two nucleotide sequences by taking into 
account codon degeneracy, amino acid similarity, reading frame positioning and the like. 
Substantial identity of amino acid sequences for these purposes normally means sequence 
identity of at least 70%, preferably at least 80%, more preferably at least 90%, and most 
preferably at least 95 % . 

Another indication that nucleotide sequences are substantially identical is if 
two molecules hybridize to each other under stringent conditions. Stringent conditions 
are sequence dependent and will be different in different circumstances. Generally, 
stringent conditions are selected to be about 5° C lower than the thermal melting point 
(Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength and pH) at which 50 % of the target sequence 
hybridizes to a perfectly matched probe. Typically, stringent conditions will be those in 
which the salt concentration is at least about 0.02 molar at pH 7 and the temperature is at 
least about 60°C. 

Another indication that protein sequences are substantially identical is if 
one protein is immunologically reactive with antibodies raised against the other protein. 
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Thus, the proteins of the invention include proteins immunologically reactive with 
antibodies raised against MHC polypeptides. 

As used herein, the terms "isolated", "substantially pure" and 
"substantially homogenous" are used to describe a protein that has been separated from 
5 components which naturally accompany it. Typically, a monomelic protein is S 
substantially pure when at least about 60 to 75% of a sample exhibits a single 
polypeptide backbone. Minor variants or chemical modifications typically share the same 
polypeptide sequence. A substantially purified protein will typically comprise over about 
85 to 90% of a protein sample, more usually about 95%, and preferably will be over 
10 about 99% pure, either by mass or by molecular numbers. Protein purity or 

homogeneity may be indicated by a number of means well known in the art, such as 
polyacrylamide gel electrophoresis of a protein sample, followed by visualizing a single 
polypeptide band on a polyacrylamide gel upon staining. For certain purposes high 
resolution will be needed and HPLC or a similar means utilized for purification. 

15 The term "MHC polypeptide" as used herein refers to a single chain MHC 

protein (e.g., the or or £ chain of Class II molecules or the heavy chain of Class I 
molecules) which may constitute all or part of the effective portion of the MHC complex 
(i.e., a peptide comprising an antigen binding site or sites and sequences necessary for 
recognition by the appropriate T cell receptor) which is in other than its native state, for 

20 example, not associated with the cell membrane of a cell that normally expresses MHC. 

The term "altered glycosylation" refers to glycosylation of the MHC 
polypeptide in which the polypeptide is unglycosylated, or has a glycosylation pattern 
which differs from that found on the native polypeptide. Altered glycosylation in this 
application refers to glycosylation achieved through in vivo processes, but does not refer 

25 to in vitro processes such as treatment of the MHC polypeptide with an enzyme or 
chemical to produce a deglycosylated molecule. 

An "unglycosylated recombinant MHC polypeptide" is an MHC class I or 
class H polypeptide that lacks substantially all naturally occurring glycosylation. 
Typically, the polypeptides of the invention will have less than about ten percent of the 

30 glycosylation observed when MHC polypeptides are produced in human cells. More 

♦ 

preferably, the polypeptides will have less than about five percent, and most preferably 
less than about one percent of the carbo-hydrates attached to MHC polypeptides produced 
in human cells. 



The unglycosylated recombinant MHC polypeptides of the invention are 
typically produced by a prokaryotic host cell that has been transformed with a nucleotide 
sequence that encodes the MHC polypeptide. Recombinant DNA techniques are 
generally used to link the MHC-encoding nucleotide sequence to signals that control gene 
expression. As a consequence of being produced in prokaryotic host cells, the MHC 
polypeptides lack the carbohydrate moieties that are normally found on MHC 
polypeptides from eukaryotic cells. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows kinetics of peptide binding to recombinant DR2 chains 
expressed in E. coli. 

Figure 2 shows the stability of recombinant DR2-peptide complexes. 

Figure 3 shows optimum pH for maximum binding of MBP peptids to 
purified recombinant DR2 polypeptide chains. 

Figure 4 shows ylFN production in T cells contacted with complexes of 

the invention. 

Figure 5 shows ylFN production in T cells contacted with complexes of 

the invention. 

Figure 6 shows efficacy of complexes of the invention in ah animal model 
for multiple sclerosis. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
The present invention provides recombinant MHC polypeptides that can be 
used to form complexes useful for modulating T cell function, and methods for 
producing the MHC polypeptides. The complexes, which consist of the MHC 
polypeptides complexed with antigenic peptides, can be used to inhibit a deleterious T 
cell-mediated immune response, such as allergic responses, allograft rejection, and 
autoimmune diseases. In addition, the complexes can be used to promote immune 
responses and can be used as vaccines. 

The present invention also provides methods for producing MHC proteins 
on a commercially viable scale* A further advantage provided by the present invention is 
that it provides a readily adaptable means of producing modified MHC polypeptides that 
are useful for a variety of desired uses. For example, when using the complexes for 
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promoting immune responses or vaccines, it is desirable to modify the MHC 
polypeptides to allow attachment to a competent antigen presenting cell bearing ligands 
involved in the costimulatory signal responsible for T cell activation. Alternatively, the 
MHC complex can be linked to isolated costimulatory ligands such that T cell 
proliferation is induced. Thus, T cells will respond to the antigenic peptide presented by 
the complexes and an immune response will be initiated. 

The unglycosylated MHC polypeptides of the invention are produced in 
prokaryotic cells, such as E. coli. The prokaiyotically-produced MHC polypeptides of 
the invention bind antigenic peptide with an efficiency similar to that of glycosylated, 
native MHC polypeptides. 

According to the invention, nucleotide sequences that code for the desired 
MHC polypeptides are isolated and transformed into suitable prokaryotic host cells, 
which are grown in culture under conditions that result in expression of the MHC 
polypeptides. The MHC polypeptides are then isolated from the cells or the culture 
supernatant and associated with the appropriate antigenic peptide to form a complex of 
the invention. Pharmaceutical compositions are prepared and administered according to 
standard techniques. For a general description of this approach, see U.S. Patent Nos. 
5,130,297 and 5,194,425. 
MHC Polypeptides 

The proteins encoded by the MHC have been extensively studied in both 
the human and murine systems. In general, they have been classified as Class I proteins, 
found on the surfaces of all cells and primarily recognized by cytotoxic T cells; and 
Class II proteins which are found on the surfaces of several cells, including accessory 
cells such as macrophages, and are involved in presentation of antigens to helper T cells. 
Some of the histocompatibility proteins have been isolated and characterized. For a 
general review of MHC protein structure and function, see Fundamental Immunology, 2d 
Ed., W.E. Paul, ed., Ravens Press N.Y, 1989. 

Several types of MHC complexes have been studied. The MHC 
complexes encoded by the murine I-A and I-E (class II) subregion* have been shown to 
consist of two noncovalently associated peptide chains: an alpha chain of 32-38 kd and a 
beta chain of 26-29 kd. A third, invariant, 31 kd peptide is noncovalently associated 
with these two peptides in the ceil and generally dissociates to allow for loading of the 
antigenic peptide. Surface expression of the invariant chain inhibits the ability of the 
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MHC class II chains to bind and present peptide. The alpha and beta chains of seven 
allelic variants of the I- A region have been cloned and sequenced (Estees et al, M T cell 
Clones 0 in Regulation of Immune Gene Expression, Feldman et al, eds. (Humana Press 
1985), pp. 3-19. Methods for purifying the murine I-A (Class II) histocompatibility 
5 proteins have been disclosed by Turkewitz, A.P., et aL, Molecular Immunology (1983) 
20: 1139-1147. These methods, which are also suitable for Class I molecules, involve 
preparation of a soluble membrane extract from cells containing the desired MHC 
molecule using nonionic detergents, such as NP-40, Tween 80 and the like. The MHC 
molecules are then purified by affinity chromatography, using a column containing 

10 antibodies raised against the desired MHC molecule. Use of 0.02% Tween-80 in the 
elution buffer is helpful to eliminate aggregation of the purified molecules. 

The human Class I proteins have also been studied . The MHC of humans 
(HLA) on chromosome 6 has three loci, HLA- A, HLA-B, and HLA-C, the first two of 
which have a large number of alleles encoding alloantigens. These are found to consist 

15 of a 44 kd subunit and a 12 kd beta 2 -microglobulin subunit which is common to all 

antigenic specificities. Isolation of these detergent-soluble HLA antigens was described 
by Springer, T.A., et al 9 Proe. Natl Acad. ScL USA (1976) 73: 2481-2485; 
Clementson, K.J., et ah, in "Membrane Proteins" Azzi, A., ed; Bjorkman, P., Ph.D. 
Thesis Harvard (1984). 

20 Further work has resulted in a detailed picture of the 3-D structure of 

HLA-A2, a Class I human antigen. (Bjorkman, PJ., et ah, Nature (1987) 329: 506- 
512, 512-518. In this picture, the /^-microglobulin protein and alpha 3 segment of the 
heavy chain are associated; the alphaj and alpha 2 regions of the heavy chain appear to 
form antigen-binding sites to which the peptide is bound (Science (1987) 238:613-614, 

25 Bjorkman, P.J. et al. Nature (supra). Soluble HLA-A2 can be purified after papain 

digestion of plasma membranes from the homozygous human lymphoblastoid cell line J- 
Y as described by Turner, MJ. et al, J. Biol Chem. (1977) 252: 7555-7567. Papain 
cleaves the 44 kd chain close to the transmembrane region yielding a molecule comprised 
of alpha u alpha 2 , alpha 3 , and /S 2 microglobulin. 

30 The three dimensional structure of human Class II MHC antigens has also 

been determined and is similar to that of of Class I molecules. Antigenic peptides are 
bound in an open ended antigen binding groove. The binding groove is formed from the 
N-terminal domain portions of two class II chains which extend from the membrane 
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bilayer. (Brown, et ah, Nature 364: 33-39 (1993)). Cloning of the Class II genes (as 
described by Estees, supra) permits manipulation of the Class II MHC binding domains 
for example, as described below. 
Cloning of MHC genes 
5 The amino acid sequence of each of a number of Class II proteins are 

known, and the genes or cDNAs have been cloned. Thus, these nucleic acids can be 
used to express the MHC polypeptides in a prokaryotic host cell according to the 
invention, as described herein. 

If a desired MHC gene or cDNA is not available, cloning methods known 
10 to those skilled in the art may be used to isolate the genes. One such method that can be 
used is to purify the desired MHC polypeptide, obtain a partial amino acid sequence, 
synthesize a nucleotide probe based on the amino acid sequence, and use the probe to 
identify clones that harbor the desired gene from a cDNA or genomic library. 

MHC polypeptides can be obtained by isolation from lymphocytes and 
15 screened for the ability to bind the desired peptide antigen. The lymphocytes are from 
the species of individual which will be treated with the complexes. For example, they 
may be isolated from human B cells from an individual suffering from the targeted 
autoimmune disease. The B cells can first be immortalized by transformation with a 
replication deficient Epstein-Barr virus, uti li zing techniques known in the art. 
20 MHC polypeptides have been isolated from a multiplicity of cells using a 

variety of techniques including solubilization by treatment with papain, by treatment with 
3M KC1, and by treatment with detergent. In a preferred method detergent extraction of 
Class II protein from lymphocytes followed by affinity purification is used. Detergent 
can then be removed by selected methods such as dialysis. Purification methods for 
25 MHC polypeptides are also discussed in the preceding section. 

After isolation of the enzyme, a partial amino acid sequence is determined 
and degenerate oligonucleotide probes, designed to hybridize to the desired gene, are 
synthesized. Amino acid sequencing is performed and oligonucleotide probes are 
synthesized according to standard techniques as described, for instance, in Sambrook et 
30 al 9 (1989) Molecular Cloning: A Laboratory Manual (2nd ed.), Vols. 1-3, Cold Spring 
Harbor Laboratory. 

Genomic or cDNA libraries are prepared according to standard techniques 
as described, for instance, in Sambrook et al., supra. To construct genomic libraries, 
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large segments of genomic DNA are generated by random fragmentation and are ligated 
with vector DNA to form concatemers that can be packaged into the appropriate vector. 
Two kinds of vectors are commonly used for this purpose, bacteriophage lambda vectors 
and cosmids, 

5 To prepare cDNA, mRNA from the organism of interest is first isolated. 

Eukaryotic mRNA has at its 3' end a string of adenine nucleotide residues known as the 
poly-A tail. Short chains of oligo d-T nucleotides are then hybridized with the poly-A 
tails serving as a primer for the enzyme reverse transcriptase* This enzyme uses RNA as 
a template to synthesize a complementary DNA (cDNA) strand. A second DNA strand 
10 is then synthesized using the first cDNA strand as a template. Linkers are added to the 
double-stranded cDNA for insertion into a plasmid or X phage vector for propagation in 
E. coli. 

Identification of clones in either genomic or cDNA libraries harboring the 
desired nucleic acid segments is performed by either nucleic acid hybridization, or 

15 immunological detection of the encoded protein if an expression vector is used. The 
bacterial colonies are then replica plated on solid support, such as nitrocellulose filters. 
The cells are lysed and probed with either oligonucleotide probes described above or 
with antibodies to the desired protein. 

Other methods well known to those skilled in the art can also be used to 

20 identify desired genes. For example, amplification techniques, such as the polymerase 
chain reaction (PCR) can be used to amplify the desired nucleotide sequence. U.S. 
Patents Nos. 4,683,195 and 4,683,202 describe this method. Sequences amplified by 
PCR can be purified from agarose gels and cloned into an appropriate vector according 
to standard techniques. 

25 Prokaryoric E xpression of MHC Polypeptides 

Prokaryotes that arc useful as host cells, according to the present 
invention, most frequently are represented by various strains of E. coli. However, other 
microbial strains can also be used, such as bacilli, for example Bacillus subtilis, various 
species of Pseudomonas. or other bacterial strains. 

30 According to the invention, the MHC polypeptides are expressed from 

cloned nucleotide sequences that encode the MHC polypeptides by operably linking the 
truncated or full-length nucleic acids to signals that direct gene expression in 
prokaryotes. A nucleic acid is "operably linked" when it is placed into a functional 
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relationship with another nucleic acid sequence. For instance, a promoter or enhancer is 
operably linked to a coding sequence if it affects the transcription of the sequence. 
Generally, operably linked means that the nucleic acid sequences being linked are 
contiguous and, where necessary to join two protein coding regions, contiguous and in 
5 reading frame. 

The genes encoding the MHC molecules may be inserted into an 
"expression vector", "cloning vector", or "vector," terms which are used interchangeably 
herein and usually refer to plasmids or other nucleic acid molecules that are able to 
replicate in a chosen host cell. Expression vectors may replicate autonomously, or they 

10 can replicate by being inserted into the genome of the host cell, by methods well known 
in the art. Vectors that replicate autonomously will have an origin of replication or 
autonomous replicating sequence (ARS) that is functional in the chosen host cell(s). 

Plasmid vectors that contain replication sites and control sequences derived 
from a species compatible with the chosen host are used. For example, E. coli is 

15 typically transformed using derivatives of pBR322, a plasmid derived from an E. coli 
species by Bolivar et al. y Gene (1977) 2: 95* Often, it is desirable for a vector to be 
usable in more than one host cell, in E. coli for cloning and construction, and in a 
Bacillus cell for expression. 

The expression vectors typically contain a transcription unit or expression 

20 cassette that contains all the elements required for the expression of the DNA encoding 
the MHC molecule in the host cells, A typical expression cassette contains a promoter 
operably linked to the DNA sequence encoding a MHC polypeptide and a ribosome 
binding site. The promoter is preferably positioned about the same distance from the 
heterologous transcription start site as it is from the transcription start site in its natural 

25 setting. As is known in the art, however, some variation in this distance can be 

accommodated without loss of promoter function. In addition to a promoter sequence, 
the expression cassette can also contain a transcription termination region downstream of 
the structural gene to provide for efficient termination. The termination region may be 
obtained from the same gene as the promoter sequence or may be obtained from a 

30 different gene. 

Commonly used prokaryotic control sequences, which are defined herein 
to include promoters for transcription initiation, optionally with an operator, along with 
ribosome binding site sequences, include such commonly used promoters as the beta- 
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lactamase (penicillinase) and lactose (lac) promoter systems (Change et al. , Nature 
(1977) 198: 1056) and the tryptophan (trp) promoter system (Goeddel et aL, Nucleic 
Acids Res. (1980) 8: 4057) and the lambda-derived P L promoter and N-gene ribosome 
binding site (Shimatake et aL, Nature (1981) 292: 128). Any available promoter system 
that functions in prokaryotes can be used. 

Either constitutive or regulated promoters can be used in the present 
invention. Regulated promoters can be advantageous because the host cells can be grown 
to high densities before expression of the MHC polypeptides is induced. High level 
expression of heterologous proteins slows cell growth in some situations. Regulated 
promoters especially suitable for use in E. coli include the bacteriophage lambda P L 
promoter, the hybrid trp-lac promoter (Amann et al, Gene (1983) 25: 167; de Boer et 
al, Proc. Natl. Acad. ScL USA (1983) 80: 21, and the bacteriophage T7 promoter 
(Studier et al, J. Mol Biol. (1986); Tabor et al, (1985). These promoters and their 
use are discussed in Sambrook et at., supra. 

For expression of MHC polypeptides in prokaryotic cells other than E. 
coli, a promoter that functions in the particular prokaryotic species is required. Such 
promoters can be obtained from genes that have been cloned from the species, or 
heterologous promoters can be used. For example, the hybrid trp-lac promoter functions 
in Bacillus in addition to E. coli. 

A ribosome binding site (RBS) is also necessary for expression of MHC 
polypeptides in prokaryotes. An RBS in E. coli, for example, consists of a nucleotide 
sequence 3-9 nucleotides in length located 3-11 nucleotides upstream of the initiation 
codon (Shine and Dalgarno, Nature (1975) 254: 34; Steitz, In Biological regulation and 
development: Gene expression (ed, R.F. Goldberger), vol- 1, p. 349, 1979, Plenum 
Publishing, NY). 

Translational coupling may be used to enhance expression. The strategy 
uses a short upstream open reading frame derived from a highly expressed gene native to 
the translational system, which is placed downstream of the promoter, and a ribosome 
binding site followed after a few amino acid codons by a termination codon. Just prior 
to the termination codon is a second ribosome binding site, and following the termination 
codon is a start codon for the initiation of translation. The system dissolves secondary 
structure in the RNA, allowing for the efficient initiation of translation. See Squires, et. 
al. (1988), J. Biol. Chem. 263: 16297-16302. 
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The MHC polypeptides can be expressed intracellularly, or can be secreted 
from the cell. Intracellular expression often results in high yields. However, some of 
the protein may be in the form of insoluble inclusion bodies. Although intracellularly 
produced MHC polypeptides of the present invention are active upon being harvested 
5 following cell lysis, the amount of soluble, active MHC polypeptide may be increased by 
performing refolding procedures {see, e.g., Sambrook et aL, supra.; Marston et aL, 
Bio/Technology (1984) 2: 800; Schoner et aL, Bio/Technology (1985) 3: 151). More 
than one MHC polypeptide may be expressed in a single prokaryotic cell by placing 
multiple transcriptional cassettes in a single expression vector, or by utilizing different 
10 selectable markers for each of the expression vectors which are employed in the cloning 
strategy. 

A second approach for expressing the MHC polypeptides of the invention 
is to cause the polypeptides to be secreted from the cell, either into the periplasm or into 
the extracellular medium. The DNA sequence encoding the MHC polypeptide is linked 

15 to a cleavable signal peptide sequence. The signal sequence directs translocation of the 
MHC polypeptide through the cell membrane. An example of a suitable vector for use 
in E. coli that contains a promoter-signal sequence unit is pTA1529, which has the E. 
coli phoA promoter and signal sequence {see, e.g., Sambrook et aL, supra.; Oka et aL, 
Proc. NatL Acad. Sci. USA (1985) 82: 7212; Talmadge et aL, Proc. Natl. Acad. Sci. 

20 USA (1980) 77: 3988; Takahara et al., J. Biol. Chem. (1985) 260: 2670). Once again, 
multiple polypeptides can be expressed in a single cell for periplasmic association. 

The MHC polypeptides of the invention can also be produced as fusion 
proteins. This approach often results in high yields, because normal prokaryotic control 
sequences direct transcription and translation. In E. coli> lacZ fusions are often used to 

25 express heterologous proteins. Suitable vectors are readily available, such as the pUR, 
pEX, and pMRlOO scries {see, e.g., Sambrook et aL, supra.). For certain applications, 
it may be desirable to cleave the non-MHC amino acids from the fusion protein after 
purification. This can be accomplished by any of several methods known in the art, 
including cleavage by cyanogen bromide, a protease, or by Factor X. {see, e.g., 

30 Sambrook et aL, supra.; Itakura et aL, Science (1977) 198: 1056; Goeddel et aL, 
Proc. NatL Acad. Sci. USA (1979) 76: 106; Nagai et aL, Nature (1984) 309; 810; 
Sung et aL, Proc. NatL Acad. Sci. USA (1986) 83: 561). Cleavage sites can be 
engineered into the gene for the fusion protein at the desired point of cleavage. 
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A preferred system for obtaining recombinant proteins from E. coli which 
maintains the integrity of their N-termini has been described by Miller et al 
Biotechnology 7:698-704 (1989). In this system, the gene of interest is produced as a C- 
terminal fusion to the first 76 residues of the yeast ubiquitin gene contianing a peptidase 
5 cleavage site. Cleavage at the junction of the two moieties results in production of a 
protein having an intact authentic N-terminal reside. 

The vectors containing the nucleic acids that code for the MHC 
polypeptide are transformed into prokaryotic host cells for expression, "Transformation" 
rcfen to the introduction of vectors containing the nucleic acids of interest directly into 

10 host cells by well known methods. The particular procedure used to introduce the 
geneue material into the host cell for expression of the MHC polypeptide is not 
particularly critical. Any of the well known procedures for introducing foreign 
nucleotide sequences into host cells may be used. It is only necessary that the particular 
procedure utilized be capable of successfully introducing at least one gene into the host 

15 cell which is capable of expressing the gene. 

Transformation methods, which vary depending on the type of host cell, 
include elcctroporation; transfection employing calcium chloride, rubidium chloride 
calcium phosphate, or other substances; microprojectile bombardment; infection (where 
the vector is an infectious agent); and other methods. See, generally, Sambrook et al> 

20 (1989) supra, and Current Protocols in Molecular Biology , supra. Reference to cells 
into which the nucleic acids described above have been introduced is meant to also 
include the progeny of such cells. Transformed prokaryotic cells that contain expression 
vectors for expressing MHC polypeptides are also included in the invention. 

After standard transfection or transformation methods are used to produce 

25 prokaryotic cell lines that express large quantities of the MHC polypeptide, the 

polypeptide is then purified using standard techniques. See, e.g., Colley et al (1989) 
Biol Chem. 64: 17619-17622; and Methods in Enzymology, "Guide to Protein 
Purification", M. Deutscher, ed. Vol. 182 (1990). The recombinant cells are grown and 
the MHC polypeptide is expressed. The purification protocol will depend upon whether 

30 the MHC polypeptide is expressed intracellular^, into the periplasm, or secreted from 
the cell. For intracellular expression, the cells are harvested, lysed, and the MHC 
polypeptide is recovered from the cell lysate (Sambrook et a/., supra.). Periplasmic 
MHC polypeptide is released from the periplasm by standard techniques (Sambrook et 
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aL, supra,). If the MHC polypeptide is secreted from the cells, the culture medium is 
harvested for purification of the secreted protein* The medium is typically clarified by 
centrifugation or filtration to remove cells and cell debris. 

The MHC polypeptides can be concentrated by adsorption to any suitable 
5 resin such as, for example, CDP-Sepharose, Asialoprothrombin-Sepharose 4B, or Q 
Sepharose, or by use of ammonium sulfate fractionation, polyethylene glycol 
precipitation, or by ultrafiltration. Other means known in the art may be equally 
suitable. 

Further purification of the MHC polypeptides can be accomplished by 

10 standard techniques, for example, affinity chromatography, ion exchange 

chromatography, sizing chromatography, or other protein purification techniques used to 
obtain homogeneity. The purified proteins are then used to produce pharmaceutical 
compositions, as described below. 
Modified MHC Polypeptides 

15 The nucleotide sequences used to transfect the host cells can be modified 

according to standard techniques to yield MHC polypeptides with a variety of desired 
properties. The MHC polypeptides of the present invention can be readily designed and 
manufactured utilizing various recombinant DNA techniques. Many techniques are well 
known to those skilled in the art, and are provided in the cited references. For example, 

20 the MHC polypeptides can vary from the natui^y<Kx:urring sequence at the primary 
structure level by amino acid insertions, substitutions, deletions, and the like- Protein 
fusions may also be utilized that may confer new activities or combinations of activities 
on the MHC polypeptide. These modifications can be used in a number of combinations 
to produce the final modified MHC polypeptide chain. 

25 The amino acid sequence variants can be prepared with various objectives 

in mind, including facilitating purification and preparation of the recombinant 
polypeptide. The modified polypeptides are also useful for modifying therapeutic half 
life, improving therapeutic efficacy, and lessening the severity or occurrence of side 
effects during therapeutic use. The amino acid sequence variants are usually 

30 predetermined variants not found in nature but exhibit the same peptide-binding and T- 
cell binding activity as native-sequence MHC. For instance, polypeptide fragments 
comprising only a portion (usually at least about 60-80%, typically 90-95%) of the 
primary structure may be produced. In certain preferred embodiments, the MHC 
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polypeptides consist essentially of either the or (i x domain from the fiilHength 
polypeptide. Such fragments typically comprise between about 50 and about 100 amino 
acids, preferably between about 60 and about 90, more preferably between about 70 and 
about 80. Alternatively, synthetic methods may be used to prepare polypeptides. See, 
5 e.g., Merrifield (19&6) Science 232: 341-347; Atherton et ah, Solid Phase Peptide 
Synthesis: A Practical Approach, IRL Press, Oxford). 

In general, modifications of the sequences encoding the MHC polypeptides 
is readily accomplished by a variety of well-known techniques, such as site-directed 
mutagenesis {see, Gillman and Smith (1979) Gene 8: 81-97, and Roberts, S. et ah (1987) 

10 Nature 328: 731-734). Most modifications are evaluated by routine screening in a 
suitable assay for the desired characteristic. For instance, the effect of various 
modifications on the ability of the polypeptide to bind peptide or affect T-cell 
proliferation can be easily determined using the assays described below. Modifications 
of other properties such as redox or thermal stability, hydrophobicity, susceptibility to 

15 proteolysis, or the tendency to aggregate are all assayed according to standard 
techniques. 

For certain applications, the MHC cDNA coding sequences are modified 
to delete the transmembrane domain and express the resulting soluble MHC polypeptides. 
Truncation of the MHC cDNA may be performed, for example, by 

20 oligonucleotide-directed deletion mutagenesis or polymerase chain reaction. 

Oligonucleotide-directed in vitro mutagenesis is described, for example, by Kunkel et ah 
(1987) Meth. Enzymoh 154: 367-382. See also, Current Protocols in Molecular Biology , 
Ausubel et ah, eds., Greene Publishing and Wiley-Interscience, New York (1987 and 
periodic supplements). 

25 Pharmaceutical Use of MHC Polypeptides 

The unglycosylated, prokaryotically-expressed MHC polypeptides of the 
invention can be used to form complexes with a peptide that represents an antigen 
associated with, for example, autoimmunity, allograft rejection or allergic responses. 
The components of the complex are chosen to have a desired effect on the immune 

30 system. An effective portion of an MHC polypeptide is one that comprises the antigen 
binding sites and sequences necessary for recognition of the MHC-peptide complex by 
the appropriate T cell receptor. The MHC component can be either a Class I or a Class 
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II molecule. The association between the peptide antigen and the antigen binding sites of 
the MHC protein can be by covalent or by noncovalent binding. 

In other embodiments the complexes may also contain an effector 
component which is generally a toxin or a label. The effector portion may be conjugated 
5 to either the MHC-encoded protein or to the autoantigenic peptide. Production and use 
of complexes are disclosed in U.S. Patent No. 5,130,297, supra. 
Peptide Antigen? 

The antigenic peptides used in the complexes of the invention are are at 
least about 8 residues in length, usually at least about 10 residues, more usually at least 

10 about 12. Usually, the maximum length is about 30 residues, more usually about 25, 
and often less than 20. The length of peptides capable of binding an MHC molecule, 
however, can vary. Thus, peptides of greater length, e.g., up to 100 residues can also 
be used in the complexes. Usually, the peptides will be less than about 50 residues in 
length, preferably less than about 30. 

15 The antigenic proteins or tissues for a number of immunopathologies are 

known. For example, the complexes can be used to treat allergic responses. Examples 
of such conditions include food hypersensitivities such as celiac disease and crohn disease 
and allergic responses to ragweed, dust mites, cats, honey bee venom, and grass pollen. 
For a review of allergic diseases suitable for treatment using the methods of the present 

20 invention see, CTHehir, et al Ann. Rev. Immunol., 9:67-95 (1991). 

In experimentally induced autoimmune diseases, antigens involved in 
pathogenesis have been characterized: in arthritis in rat and mouse, native type-II 
collagen is identified in collagen-induced arthritis, and mycobacterial heat shock protein 
in adjuvant arthritis (Stuart et al. (1984), Ann. Rev. Immunol. 2: 199-218; van Eden et 

25 al (1988), Nature 331: 171-173); thyroglobulin has been identified in experimental 

allergic thyroiditis (EAT) in mouse (Maron et al (1988), J. Exp. Med. 152: 1115-1120); 
acetyl choline receptor (AChR) in experimental allergic myasthenia gravis (EAMG) 
(Lindstrom et al (1988), Adv. Immunol 42: 233-284); and myelin basic protein (MBP) 
and proteolipid protein (PLP) in experimental allergic encephalomyelitis (EAE) in mouse 

30 and rat (See Acha-Orbea et al (1989) Ann. Rev. Immunol 7: 377-405). In addition, for 
example, target antigens have been identified in humans: type-n collagen in human 
rheumatoid arthritis (Holoshitz et al (1986) Lancet ii: 305-309); and acetyl choline 
receptor in myasthenia gravis (Lindstrom et al (1988) supra). 
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It is believed that the presentation of antigen by the MHC glycoprotein on 
the surface of antigen-presenting cells (APCs) occurs subsequent to the hydrolysis of 
antigenic proteins into smaller peptide units. The location of these smaller segments 
within the antigenic protein can be determined empirically. These segments are thought 
5 to be 8-18 residues in length, and contain both the agretope (recognized by the MHC 
molecule) and the epitope (recognized by T cell receptor on the T-helper cell). The 
epitope itself is a contiguous or non-contiguous sequence of 5-6 amino acids which 
recognizes the antigen-specific receptor of T-helper cells. The agretope is a continuous 
or non-contiguous sequence which is responsible for the association of the peptide with 

10 the MHC proteins. 

The empirical process of determining the relevant 8-18 amino acid 
subunits is illustrated using the alpha subunit of the acetylcholine receptor of skeletal 
muscle. In myasthenia gravis (MG) an autoimmune response is directed to a region of 
this subunit. A loss of the acetyl choline receptors on the postsynaptic membrane of the 

15 neuromuscular junction causes the MG symptoms. 

In MG, autoantibodies against the alpha subunit of the acetylcholine 
receptor (AChR) are associated with the autoimmune response directed at the AChR. 
Eighty five percent of MG patients have autoantibodies reactive with the alpha subunit. 
Of these, 60% have antibodies that bind to a peptide segment of the alpha subunit called 

20 the main immunogenic region (MIR) which is located between residues 60 and 80 
(Tzartos and Lindstrom, Proc. Natl Acad. ScL USA (1980) 77: 755). The peptide 
segments recognized by autoreactive human T cells also are located on the alpha subunit 
(Hohlfeld et aL, Proc. Natl. Acad. ScL USA (1987) 84: 5379-5383. The epitopes 
recognized by these T cells lie between residues 1-30, 125-147, 169-181, 257-271 and 

25 351-368. In addition, in humans the AChR peptides 195-212 and 257-269 have been 
partially characterized as epitopes in myasthenia gravis patients of the HLA-DR5 and 
HLA-DR3, DQw2 MHC haplotypes, respectively (see Acha-Orbea (1989) supra). 

The peptides carrying agretopes permitting presentation of the epitopes 
associated with alpha subunit of this receptor are readily determined. For example, 

30 determination of the appropriate peptides in a mouse model is carried out as follows. 

Strains of mice which, when immunized with Torpedo californicus AChR 
develop a disease with many of the features of human myasthenia gravis, are used as 
model. MHC Class II glycoproteins are isolated from spleen cells of mice of this strain 
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using lectin and monoclonal antibody affinity supports. The purified MHC Class II 
proteins are incorporated into phospholipid vesicles by detergent dialysis. The resultant 
vesicles are then allowed to fuse to clean glass cover slips to produce on each a planar 
lipid bilayer containing MHC molecules (Brian and McConnell, Proc. Natl Acad. Set 
5 USA (1984) 81: 6159. 

One cover slip containing MHC Class II molecules embedded in the 
adherent planar lipid membrane is placed in each well of several 24-well culture plates. 
Each one of the approximately 40 overlapping 20-residue synthetic peptides 
corresponding to the alpha subunit sequence and containing one or more radiolabeled 

10 amino acid residues (prepared as described below) is placed in a well with cover slip and 
PBS and allowed to incubate several days. The extent of binding of peptide in the MHC 
Class II glycoprotein antigen binding site is measured by the amount of radio-activity 
incorporated into the MHC Class II-planar lipid membrane on the cover slip versus 
planar lipid membrane alone. Specific incorporation of radioactivity indicates that the 

15 bound peptide contains an agretope (MHC Class II peptide binding site) of one of the 

several species of MHC Class II molecules present in the planar lipid membrane. In this 
way, the set of agretopes for the alpha subunit of AChR is defined for the mouse strain 
that displays the symptoms of MG upon immunization with AChR or purified alpha 
subunit. 

20 Next, each of the alpha subunit synthetic peptide segments that contain an 

agretope is again incorporated into the antigen binding site of isolated MHC Class II 
proteins embedded in planar lipid membranes on cover slips. One cover slip is added to 
each well of a 24-well culture plate, and spleen cells from mice immunized against 
AChR (and from which strain the adherent MHC Class II proteins were isolated) are 

25 added to each well. T cell hybridoma proliferation, as measured by tritiated thymidine 

uptake into DNA, indicates that the MHC Class II protein-bound peptide contains both an 
agretope and an epitope for binding to the T cell. Activation of T cell clones is 
determined by measuring XL-3 production (see, Quill et al. 9 supra). 

The Dupont apparatus and technique for rapid multiple peptide synthesis 

30 (RAMPS) is used to synthesize the members of a set of overlapping (10 residue overlap), 
20-residue peptides from the alpha subunit of Torpedo californicus AChR. One or more 
radioactive amino acids is incorporated into each synthetic peptide. The pentafluorphenyl 
active esters of side chain-protected, FMOC amino acids are used to synthesize the 
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peptides, applying standard stepwise solid phase peptide synthetic methods, followed by 
standard side chain deprotection and simultaneous release of the peptide amide from the 
solid support. 

Alternatively the overlapping sequences which include the putative 
5 segments of 8-18 amino acids of the antigenic protein, such as acetylcholine receptor 
protein, can be synthesized on the method of Geysen, H.M., et ah J. Immun. Meth. 
(1987) 102: 274. The synthesized radiolabeled peptides are tested by incubating them 
individually (on the plates) with purified MHC proteins that have been formulated into 
lipid membrane bilayers as above. 

10 In multiple sclerosis (MS), which results in the destruction of the myelin 

sheath in the central nervous system, myelin basic protein (MBP), the major protein 
component of myelin is the principal autoantigen. Pertinent segments of the MBP 
protein are also determined empirically, using a strain of mice which develops 
experimental allergic encephalitis (EAG) when immunized with bovine myelin basic 

15 protein. 

Systemic lupus erythematosus (SLE) has a complex symptomology, but 
results from an autoimmune response to red blood cells. Peptides which are the 
antigenic effectors of this disease are found in the proteins on the surface of red blood 
cells. 

20 Rheumatoid arthritis (RA) is a chronic inflammatory disease resulting from 

an immune response to proteins found in the synovial fluid. 

Insulin-dependent diabetes mellitus (IDDM) results from autoimmune 

attack on the beta cells within the Islets of Langerhans which are responsible for 

secretion of insulin. Circulating antibodies to Islets cells surface antigens and to insulin 
25 are known to precede IDDM. Critical peptides in eliciting the immune response in 

IDDM are believed to be portions of the insulin sequence and the beta cell membrane 

surface proteins. 

The relevant antigenic peptide subunits, as they are relatively short, can 
readily be synthesized using standard automated methods for peptide synthesis. In the 
30 alternative, they can be made recombinantly using isolated or synthetic DNA sequences, 

although this is not the most efficient approach for peptides of this length. :| 
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Thus, in summary, a set of labeled test peptides is prepared, and those 
which bind to MHC in planar lipid membranes containing MHC proteins are shown to 
contain the agretope. 

The identified peptides are then prepared by conventional solid phase 
5 synthesis and the subset which contain epitopes for the disease-inducing helper T cell 
clones is determined by incubation of the candidate peptides with murine antigen- 
presenting cells (APC) (or with isolated MHC complex) and spleen or lymph node 
T cells from mice immunized with the full length protein. Successful candidates will 
stimulate T cell proliferation in this system. This second, smaller, subset represents the 
10 suitable peptide component. 
Formation of the Complex 

The elements of the complex can be associated by standard means known 
in the art, as described in U.S. Patent No. 5,130,297, supra. The antigenic peptides can 
be associated noncovalently with the pocket portion of the MHC protein by, for example, 
15 mixing the two components. Excess peptide can be removed by any of a number of 
standard procedures, such as ultrafiltration or dialysis. The peptides can also be 
covalently bound using standard procedures by, for example, photo affinity labelling, 
(see, e.g., Hall et aL, Biochemistry 24: 5702-5711 (1985). Alternatively, the peptide 
can be covalently bound to the MHC component by expressing the peptide and MHC 
20 component from a single polynucleotide sequence. For example, the peptide can be 

covalently attached to the MHC component through a flexible peptide linker (see, e.g. , 
Kozono et al Nature 369:151-154 (1994)). 
Assessment pf the Complex 

The complexes formed using MHC polypeptides of the invention can be 
25 assayed using an in vitro system or using an in vivo model. In the in vitro system, the 
complex is incubated with peripheral blood T cells from subjects immunized with, or 
showing immunity to, the protein or antigen responsible for the condition associated with 
the peptide of the complex. The successful complexes will induce anergy in syngeneic T 
cells and prevent proliferation of the T cells even upon stimulation with additional 
30 antigen. 

In the in vivo system, T cells that proliferate in response to the isolated 
epitope or to the full length antigen in the presence of APC are cloned. The clones are 
injected into histocompatible animals that have not been immunized, in order to induce 
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the autoimmune disease. The relevant complex should, ameliorate or eliminate the 
symptoms of the disease. 

Either of the types of complexes, i.e., with or without the effector 
component, may be used. In one mode the treatment is two-fold. The individual is 
5 treated with the complex of prokaryotically-expressed, MHC-encoded antigen-presenting 
protein containing an effective portion of the antigenic peptide to down-regulate the 
immune system. Further down-regulation is achieved by treatment with the three 
component complex with includes the prokaryotically-expressed, MHC -encoded antigen- 
presenting protein, an effective portion of antigenic peptide which is specific for the 

10 autoimmune disease being treated, and an effector component. In addition, panels of 

complexes may be used for treatment. For example, if it is suspected that more than one 
peptide of an anugen is involved in the autoimmune response, and/or if it is suspected 
that more than one anugen is involved, the individual may be treated with several 
complexes selected from a panel containing the effective portion of the appropriate 

15 prokaryotically-expressed, MHC-encoded antigen-presenting polypeptides, and effective 
portions of antigenic peptides; these may be with or without effector components. 

Administration of a labeled complex permits identification of those 
portions of the immune system involved in the disease, in diagnostic applications. 
Selection of the MHC Complexes for Therapy and/or Diagnosis 

20 In order to select the MHC complexes that are to be used in the diagnosis 

or treatment of an individual for a particular disease, the type of MHC antigens that are 
involved in the presentation of the antigen are identified. The following discussion 
describes the identification of antigen associated with autoimmune disease, but one of 
skill will recognize that the same general approach can be used for other diseases, such 

25 as allergies. 

Specific autoimmune dysfunctions are correlated with specific MHC types. 
Methods for identifying which alleles, and subsequently which MHC encoded 
polypeptides, are associated with an autoimmune disease are known in the art. A method 
described in EP 286447 is suitable. In this method several steps are followed. First, the 
30 association between an MHC antigen and the autoimmune disease is determined based 
upon genetic studies. The methods for carrying out these studies are known to those 
skilled in the art, and information on all known HLA disease associations in humans is 
maintained in the HLA and Disease Registry in Copenhagen. The locus encoding the 
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polypeptide associated with the disease is the one that would bear the strongest 
association with the disease. 

Second, specific alleles encoding the disease associated with MHC 
antigen/polypeptide are identified. In the identification of the alleles, it is assumed that 
5 the susceptibility allele is dominant. Identification of the allele is accomplished by 

determining the strong positive association of a specific subtype with the disease. This 
may be accomplished in a number of ways, all of which are known to those skilled in the 
art. For example, subtyping may be accomplished by mixed lymphocyte response 
(MLR) typing and by primed lymphocyte testing (PUT). Both methods are described in 

10 Weir and Blackwell, eds., Handbook of Experimental Immunology. It may also be 

accomplished by analyzing DNA restriction fragment length polymorphism (RFLP) using 
DNA probes that are specific for the MHC locus being examined. E.g., Nepom (1986) 
Annals N. Y, Acad. Sci. 475: 1. Methods for preparing probes for the MHC loci are 
known to those skilled in the art. See, e.g., Gregersen et al. (1986), Proc. Natl Acad. 

15 Sci. USA 79: 5966; Weissman et al. in Medicine in Transition: the Centennial of the 
University of Illinois College of Medicine (E.P. Cohen, ed., 1981). 

The most complete identification of subtypes conferring disease 
susceptibility is accomplished by sequencing of genomic DNA of the locus, or cDNA 
copies of mRNA transcribed within the locus. The DNA that is sequenced includes the 

20 section encoding the hypervariable regions of the MHC encoded polypeptide. 

Techniques for identifying specifically desired DNA with a probe, for amplification of 
the desired region are known in the art, and include, for example, the polymerase chain 
reaction (PCR) technique. 

Once the allele that confers susceptibility to the specific autoimmune 

25 disease is identified, the polypeptide encoded within the allele is also identifiable, i.e., 
the polypeptide sequence can be deduced from the sequence of DNA within the allele 
encoding it. The MHC antigen complexes of the invention used for diagnosis and/or 
therapy are derived from the effective portion of the MHC antigen associated with the 
autoimmune disease state and from an autoimmune antigen associated with the same 

30 disease state. 

As an example, over 90% of rheumatoid arthritis patients have a haplotype 
of DR4(Dw4), DR4(Dwl4) or DR1. It is also known that a target antigen in human 
rheumatoid arthritis is type-II collagen. Hence, the complexes of the invention used for 
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treatment or diagnosis of an individual with rheumatoid arthritis would include those 
containing a polypeptide derived from the DR4(Dw4), DR1 and/or DR4(Dwl4) which is 
capable of antigen presentation for disease induction, or incapable of antigen presentation 
for disease suppression, compiexed with an effective portion of type-II collagen. 
5 As used herein, the term "individual" encompasses all mammals and all 

vertebrates which possess basically equivalent MHC systems. 

Formulation and Administration 
If the transmembrane region of the MHC subunit is included, the 
complexes formed using prokaryotically-expressed MHC polypeptides of the invention 

10 are conveniently administered after being incorporated into lipid monolayers or bilayers. 
Typically liposomes are used for this purpose but any form of lipid membrane, such as 
planar lipid membranes or the cell membrane of a cell {e.g., a red blood cell) may be 
used. The complexes are also conveniently incorporated into micelles. The data 
presented in Example 2, below, shows that MHC-peptide complexes comprising dimeric 

15 MHC molecules exist primarily as aggregates. 

Liposomes can be prepared according to standard methods, as described 
below. However, if the transmembrane region is deleted, the complex can be 
administered in a manner conventionally used for peptide-containing pharmaceuticals. 

Administration is systemic and is effected by injection, preferably 

20 intravenous, thus formulations compatible with the injection route of administration may 
be used. Suitable formulations are found in Remington's Pharmaceutical Sciences, Mack 
Publishing Company, Philadelphia, PA, 17th ed. (1985). A variety of pharmaceutical 
compositions comprising complexes of the present invention and pharmaceutical^ 
effective carriers can be prepared. The pharmaceutical compositions are suitable in a 

25 variety of drug delivery systems. For a brief review of present methods of drug 
delivery, see, Langer, Science 249: 1527-1533 (1990). 

In preparing pharmaceutical compositions using the prokaryotically- 
expressed, unglycosylated MHC polypeptides of the present invention, it is frequently 
desirable to modify the complexes of the present invention to alter their pharmacokinetics 

30 and biodistribution. For a general discussion of pharmacokinetics, see, Remington's 
Pharmaceutical Sciences, supra, Chapters 37-39. A number of methods for altering 
pharmacokinetics and biodistribution are known to one of ordinary skill in the art (see, 
e.g., Langer, supra). For example, conjugation to soluble macromolecules, such as 
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proteins, polysaccharides, or synthetic polymers, such as polyethylene glycol, is 
effective. Other methods include protection of the complexes in vesicles composed of 
substances such as proteins, lipids (for example, liposomes), carbohydrates, or synthetic 
polymers. 

5 Liposomes of the present invention typically contain the MHC -peptide 

complexes positioned on the surface of the liposome in such a manner that the complexes 
are available for interaction with the T cell receptor. The transmembrane region is 
usually first incorporated into the membrane at the time of forming the membrane. The 
liposomes can be used to target desired drugs (e.g., toxins or chemotherapeutic agents) to 

10 particular autoreactive T cells. Alternatively, the complexes embedded in the liposome 
may be used to induce anergy in the targeted cells. 

A variety of methods are available for preparing liposomes, as described 
in, e.g., Szoka et a!., Ann. Rev. Biophys. Bioeng. 9: 467 (1980), U.S. Pat. Nos. 4, 
235,871, 4,501,728 and 4,837,028. 

15 Micelles are also commonly used in the art to increase solubility of 

molecules having nonpolar regions. One of skill will thus recognize that micelles are 
useful in compositions of the present invention. Micelles comprising the complexes of 
the invention are prepared according to methods well known in the art (see, e.g., 
Remington's Pharmaceutical Sciences, supra, Chap. 20). Micelles comprising the 

20 complexes of the present invention are typically prepared using standard surfactants or 
detergents. 

Common surfactants well known to one of skill in the art can be used in 
the micelles of the present invention. Suitable surfactants include sodium laureate, 
sodium oleate, sodium lauryl sulfate, octaoxyethylene glycol monododecyl ether, 

25 octoxynol 9 and PLURONIC F-12T(Wyandotte Chemicals Corp.). Preferred surfactants 
are nonionic polyoxyethylene and polyoxypropylene detergents compatible with IV 
injection such as, TWEEN-80*, PLURONIC F~68 # , n-octyl-/S-D-giucopyranoside, and the 
like. In addition, phospholipids, such as those described for use in the production of 
liposomes, may also be used for micelle formation. 

30 Since the MHC subunits of the present invention comprise a lipophilic 

transmembrane region and a relatively hydrophilic extracellular domain, mixed micelles 
are formed in the presence of common surfactants or phospholipids and the subunits. 
The mixed micelles of the present invention may comprise any combination of the 
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subunits, phospholipids and/or surfactants. Thus, the micelles may comprise subunits 
and detergent, subunits in combination with both phospholipids and detergent, or subunits 
and phospholipid. 

For pharmaceutical compositions which comprise the complexes of the 
5 present invention, the dose will vary according to, e.g., the particular complex, the 

manner of administration, the particular disease being treated and its severity, the overall 
health and condition of the patient, and the judgment of the prescribing physician. 
Dosage levels for murine subjects are generally between about 10 fig and about 500 fig. 
A total dose of between about 50 fig and about 300 fig, is preferred. For instance, in 

10 treatments provided over the course of a disease, three 25 fig or 100 fig doses are 
effective. Total dosages range between about 0.015 and about 15 fig/kg, preferably 
about 0.15 to about 10 fig/kg. 

The pharmaceutical compositions are intended for parenteral, topical, oral 
or local administration, such as by aerosol or transdermally, for prophylactic and/or 

15 therapeutic treatment. The pharmaceutical compositions can be administered in a variety 
of unit dosage forms depending upon the method of administration. For example, unit 
dosage forms suitable for oral administration include powder, tablets, pills, and capsules. 

Preferably, the pharmaceutical compositions are administered 
intravenously. Thus, this invention provides compositions for intravenous administration 

20 which comprise a solution of the complex dissolved or suspended in an acceptable 

carrier, preferably an aqueous carrier, A variety of aqueous carriers may be used, e.g., 
water, buffered water, 0.4% saline, and the like. For instance, phosphate buffered saline 
(PBS) is particularly suitable for administration of soluble complexes of the present 
invention. A preferred formulation is PBS containing 0.02% TWEEN-80. These 

25 compositions may be sterilized by conventional, well-known sterilization techniques, or 
may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, 
or lyophilized, the lyophilized preparation being combined with a sterile aqueous solution 
prior to administration. The compositions may contain pharmaceutically acceptable 
auxiliary substances as required to approximate physiological conditions, such as pH 

30 adjusting and buffering agents, tonicity adjusting agents, wetting agents and the like, for 
example, sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium 
chloride, sorbitan monolaurate, triethanolamine oleate, etc. 
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The concentration of the complex can vary widely, L'e. t from less than 
about 0.05%, usually at or at least about 1 % to as much as 10 to 30% by weight and 
will be selected primarily by fluid volumes, viscosities, etc., in accordance with the 
particular mode of administration selected. Preferred concentrations for intravenous 
5 administration are about 0.02% to about 0. 1 % or more in PBS. 

For solid compositions, conventional nontoxic solid carriers may be used 
which include, for example, pharmaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutically acceptable nontoxic 

10 composition is formed by incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of active ingredient. 

For aerosol administration, the complexes are preferably supplied in finely 
divided form along with a surfactant and propellant. The surfactant must, of course, be 
nontoxic, and preferably soluble in the propellant. Representative of such agents are the 

15 esters or partial esters of fatty acids containing from 6 to 22 carbon atoms, such as 

caproic, octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids 
with an aliphatic polyhydric alcohol or its cyclic anhydride such as, for example, 
ethylene glycol, glycerol, erythritol, arabitol, mannitol, sorbitol, the hexitol anhydrides 
derived from sorbitol, and the polyoxyethylene and polyoxypropylene derivatives of these 

20 esters. Mixed esters, such as mixed or natural glycerides may be employed. The 

surfactant may constitute 0.1%-20% by weight of the composition, preferably 0,25-5%. 
The balance of the composition is ordinarily propellant. Liquefied propellants are 
typically gases at ambient conditions, and are condensed under pressure. Among suitable 
liquefied propellants are the lower alkanes containing up to 5 carbons, such as butane and 

25 propane; and preferably fluorinated or fluorochlorinated alkanes. Mixtures of the above 
may also be employed. In producing the aerosol, a container equipped with a suitable 
valve is filled with the appropriate propellant, containing the finely divided compounds 
and surfactant. The ingredients are thus maintained at an elevated pressure until released 
by action of the valve. 

30 The compositions containing the complexes can be administered for 

therapeutic, prophylactic, or diagnostic applications. In therapeutic applications, 
compositions are administered to a patient already suffering from a disease, as described 
above, in an amount sufficient to cure or at least partially arrest the symptoms of the 
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disease and its complications. An amount adequate to accomplish this is defined as 
"therapeutically effective dose." Amounts effective for this use will depend on the 
severity of the disease and the weight and general state of the patient. As discussed 
above, this will typically be between about 0.5 mg/kg and about 25 mg/kg, preferably 
5 about 3 to about 15 mg/kg. 

In prophylactic applications, the complexes of the invention are 
administered to a patient susceptible to or otherwise at risk of a particular disease. Such 
an amount is defined to be a "pfophylactically effective dose." In this use, the precise 
amounts again depend on the patient's state of health and weight. The doses will 

10 generally be in the ranges set forth above. 

In diagnostic applications, compositions containing the appropriate 
complexes or a cocktail thereof are administered to a patient suspected of having an 
autoimmune disease state to determine the presence of autoreactive T cells associated 
with the disease. Alternatively, the efficacy of a particular treatment can be monitored. 

15 An amount sufficient to accomplish this is defined to be a "diagnostically effective dose." 
In this use, the precise amounts will depend upon the patient's state of health and the 
like, but generally range from 0.01 to 1000 mg per dose, especially about 10 to about 
100 mg per patient. 

Kits can also be supplied for therapeutic or diagnostic uses. Thus, the 

20 complexes of the present invention may be provided, usually in a lyophilized form in a 
container. The complexes, which may be conjugated to a label or toxin, or 
unconjugated, are included in the kits with buffers, such as Tris, phosphate, carbonate, 
etc., stabilizers, biocides, inert proteins, e.g., serum albumin, or the like, and a set of 
instructions for use. Generally, these materials will be present in less than about 5% wt. 

25 based on the amount of complex and usually present in total amount of at least about 

0.001% wt. based again on the protein concentration. Frequently, it will be desirable to 
include an inert extender or excipient to dilute the active ingredients, where the excipient 
may be present in from about 1 to 99% wt. of the total composition. Where an antibody 
capable of binding to the complex is employed in an assay, this will usually be present in 

30 a separate vial. The antibody is typically conjugated to a label and formulated according 
to techniques well known in the art. 

Unless defined otherwise, all technical and scientific terms used herein 
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have the same meaning as commonly understood by one of ordinary skill in the art to 
which this invention belongs. Although any methods and materials similar or equivalent 
to those described can be used in the practice or testing of the present invention , the 
preferred methods and materials are now described. 

EXAMPLE 1 

Construction of Bacterial Expression Vector for MHC class II 

Strategy . MHC class II molecules were expressed in E. coli using an 
expression vector derived from that described by Squires et aL (J. BioL Chem. (1988) 
263: 16297-16302). The MHC gene was inserted into a modified expression vector that 
contains a 17 promoter to drive expression of the inserted gene. Both full-length and 
truncated <ATM) MHC genes were expressed* The nucleotide sequences coding for the 
transmembrane and cytoplasmically exposed regions of the MHC polypeptides were 
deleted in the ATM constructs (See SEQ. ID. Nos. 11 and 13). SEQ. ID. No. 11 shows 
the DNA sequence corresponding to the mature full-length form of the HLA DR2-Dw2 
or-chain and SEQ, ID. No. 13 shows the HLA DR2-Dw2 fr-chain. The ATM constructs 
were made by deleting the region between positions 577 and 690 of the a-chain and the 
region between positions 595 and 714 in the /S-chain respectively. 

Reagents and Materials . Oligonucleotides were synthesized on an Applied 
Biosystems 392 DNA synthesizer using 0-cyanoethyl phosphoramidite chemistry and 
were purified using Applied Biosystems OPC cartridges per manufacturers instructions. 
Plasmids pET3a and pETllb, which contain the bacteriophage T7 promoter, were 
purchased from Novagen. Plasmid pUC19 and all restriction enzymes and DNA 
modifying enzymes were purchased from New England Biolabs. 

£. coli K-12 strain W3110 was obtained from the ATCC. The cell line 
GMO3107 (source of the MHC sequences) was obtained from the National Institute of 
General Medical Sciences (NIGMS) repository at the Coriell Institute for Medical 
Research. GMO3107 is an EBV transformed human cell line that expresses high levels 
of the DR2-Dw2 heterodimer complex on its surface. 

Cpnstruction of an E. coli expression plasmid . Expression plasmids for 
MHC class II molecules were constructed from pET3a as follows. 
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pET3a was digested with £coRI, blunt-ended with DNA Polymerase I 
(Klenow fragment) and digested with EcoRV. The vector was recircularized, destroying 
both restriction sites, to generate plasmid p26404. 

Plasmid p26405 was derived from p26404 as follows. First, p26404 was 
5 digested with BamHl and the ends were filled-in with DNA polymerase I (Klenow) to 
generate blunt ends. A synthetic linker of sequence: 5\..CGGAATTCCG...3 T (SEQ. 
ID. No. 15) was introduced into the destroyed BamHl site, thus replacing it with a new 
EcoRI site. 

Plasmid p26411 was generated by digestion of p26405 with Ndel and 
10 EcoRI and insertion of a synthetic linker sequence: 

5\..TATG GCT AGC ATG ACT GGT GGA CAG CAA ATG GGT- 
3 '...AC CGA TCG TAC TGA CCA CCT GTC GTT TAC CCA- 

-CGG GAT CCG AAGCTT AG.. .3' (SEQ. ID. No. 1) 
15 -GCC CTA GGC TTCGAA TCTAA...5' (SEQ. ID. No.2) 

This linker provides the first 14 codons of the phi-10 open reading frame 

(coupler), and has a BamHl site located in the proper reading frame for subsequent 

expression of an inserted MHC gene. Downstream of the BamHl site is a Hwdin 

cloning site that together with the EcoRI site provides two downstream sites for 

20 directional cloning of inserts* 

To facilitate cloning, one of the BamHl sites in p26411 was destroyed to 

generate plasmid p27305. p26411 was digested with EcoRI + Pstl and the 3387bp 

fragment recovered. p26411 was also digested with BamHl and Pstl and the 891bp 

fragment was recovered. A synthetic linker of sequence: 

25 5\..AATTCCTACGTA...3'(SEQ. ID. No. 3) 

3\..GGATGCATCTAG...5'(SEQ. m. No. 4) 

was made and ligated with the two fragments of p26411 to generate p27305. The linker 

has both EcoRI and BamHl cohesive ends, but will regenerate only the EcoRI site upon 

ligation. Additionally, it carries a SnaBl (blunt) cloning site for future downstream 

30 manipulations. 

Plasmid p27313 incorporates a copy of the lacl q repressor protein onto 
p27305 to control unwanted transcription of the target gene prior to induction. p27305 
was digested with BamHl and Pstl and a 917 bp fragment recovered. Plasmid pETllb 
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was similarly treated and a 4608 bp fragment recovered. Ligation of the two fragments 
yielded the final form of the expression plasmid, p27313. 

Cloning of DR2-Dw2 Alpha and Beta chain genes . Poly-A+ mRNA was 
prepared from 5.0 x 10 7 viable GMO3107 cells using a Fast Track kit (Invitrogen), 
5 following the manufacturer's instructions. 

cDNA was prepared from 25ng of poly-A+ mRNA using a Clontech first- 
strand cDNA synthesis kit, following the manufacturer's instructions. 

Sequence information for the human HLA DR2-Dw2 a (dra) (Lee et aL 
(1987) Proc. Natl Acad. ScL USA 84: 4591-4597) and & (DRB5*0101) (Lee et aL 
10 (1984) Nature 299: 750-752) chain genes were obtained from the GenBank database. 
Primers were designed for PGR amplification of either the mature gene product or a 
truncated (ATM) form of each chain. The "top strand" primer of each chain included a 
portion of the phi- 10 gene corresponding to the translational coupler used in the E. coli 
expression system. 
15 DR2- Alpha chain: 
"Top strand" 

5 \ . CGGG ATCCG ATCGTGG AGG ATG ATTA AATG ATC A AAG AAGAACATGTG ATC 
ATC..3' (SEQ. ID. No. 5) 
Full-length "bottom strand" 
20 5' . , .GTCGAATTCTTACAGAGGCCCCCTGCGTT,. .3*(SEQ. ID. No. 6) 
Truncated (ATM) "bottom strand" 

5\. . ATCGAATTCAGITCTCTGTAGTCrCTGGGAG. . .3' (SEQ. ID. NO. 7) 
Dr2-Beta Chain; 
"Top strand" 

25 5 9 . . .CGGG ATCCGATCGTGG AGG ATG ATTA AATGGGGGACACCCGACC ACGTT. . . 
3* (SEQ. ID. No. 8) 
Full-length "bottom strand" 

5 \ . . GTCG A ATTCTC AGCTC ACG AGTCCTGTTGG . . . 3 ' (SEQ. ID. No. 9) 
Truncated (ATM) "bottom strand" 
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.ATCGAATTCACTTGCTCTGTGCAGATTCAGA. . .3' (SEQ. ID. No. 10) 

Following amplification, the PCR products were digested with BamUL and 
EcoRI and subcloned into the plasmid pUC9 which had been similarly treated. 
Recombinant clones were identified and sequenced. The plasmids containing 
5 recombinant or- and /3-chain genes were designated p26416 and p26417 respectively. 
Sequence analysis of the a chain revealed a point mutation (G to T) at base 649 of the 
published sequence. This mutation results in a valine to leucine substitution at residue 
217 of the full length mature gene product. No deviations from the published sequence 
were observed for the jS-chain. 

10 Full-length expression constructs . Plasmids p26416 and p26417 were 

treated with BamBI and EcoRl. Fragments corresponding to the a- and 0-chains were 
subcloned into the expression vector p27313. Recpmbinant clones were identified by 
restriction analysis and given the designations p27317 (a-chain) and p27316 (0-chain). 

ATM expression constructs . PCR amplification of truncated or- and 

15 chain genes was performed using the PCR primer pairs previously described. The 
plasmids p26416 and p26417 were used as target DNA for the or- and /8-chain, 
respectively. The PCR products were treated with BamHl and EcoRI and ligated into 
plasmid p27316 which had been treated with the same enzymes to remove the full-length 
/3-chain. The resulting plasmids were designated p26495 (a-ATM) and p26496 (/S~ 

20 ATM). 

EXAMPLE 2 
Expression of MHC Class II Molecules in E. coli 
Construction of Host Strain W31 10/DE3 . E. coli strain W3110 was made 
lysogenic for the phage lambda-DE3 (which carries a copy of the T7 RNA polymerase 
25 gene) using the DE3 lysogenization kit from Novagen, following the manufacturer's 
instructions. 

Induction of Recombinant Clones . Plasmids p27316, p27317, p26495 and 
p26496 were transformed into the host strain W3110/DE3. 

Cultures were grown at 37°C in LB containing 0.4% glucose and 100 
30 /xg/ml ampicillin. Cells were induced in mid-log growth by addition of isopropyl-0-b-D- 
thio-galactopyranoside (IPTG) (0.4 mM final concentration) and allowed to grow at 
37°C. Periodic samples were taken and chilled on ice prior to processing. 



WO 95/11702 PCT7US94/12231 

34 

Cells were harvested by centrifugation at 5000 x g at 4°C for 10 minutes. 
Cells were resuspended in TE (10 mM Tris-Cl, 1 mM EDTA pH 8,0) in a volume 
appropriate to yield 0.02 OD^ao/pl. Reduced samples were prepared by adding equal 
volumes of cell suspension and 2X SDS-sample buffer containing 0.3M 2- 
mercaptoethanol and boiling for 5 minutes. Ten microliters of sample were applied to a 
12% SDS~gel and following electrophoresis proteins were visualized with Coomasie 
Brilliant Blue staining. 

For solubility testing, 200 /xl of cell suspension was sonicated with a 
micro- tip at a setting of 4 for three 10-second bursts on ice. Insoluble material was 
separated from soluble material by centrifugation for 10 minutes at 12,000 x g at 4°C. 
The insoluble material (inclusion body) was washed once with 500 fxl of cold TE and re- 
spun. The pellet was then aspirated and dissolved in 400 pi of IX SDS-sample buffer 
containing 0.3M 2-mercaptoethanol and boiled. 

Purification of rPR2 a and B chains . Recombinant DR2 a and 0 chains 
with and without the transmembrane region were purified by preparative electroelution as 
generally described in Passmore et alJ. Immunol Meth. 155:193-200 (1992). Inclusion 
body preparations in 8M urea and 10 mM DTT at a concentration of 6 mg/ml were 
dialyzed for 16 hours against sample buffer: 25 mM Tris-HCl, pH 6*8 and 0.25 % 
SDS. Two mg of starting sample was loaded onto a Bio-Rad Prep Cell device containing 
13.5% resolving and 4% stacking gel. Electrophoresis was performed at 40 mA constant 
current for the duration of elution. Fractions (3ml) were collected starting at 360 
minutes with a flow rate of 1 ml/min. Eluted fractions were analyzed for by 13.5% 
non-reducing SDS-PAGE followed by silver stain. Based on polyacrylamide gel 
analysis, a and monomers were pooled and concentrated with an Amicon Centriprep 
10k molecular weight cut-off filtration system. The final monomer preparations were 
dialyzed against PBS containing 0.01% Tween-80 and 0.02% azide. The yield was 
calculated by Lowry assay, with a typical recovery ranging between 15-30% of the 
loaded protein. MHC polypeptides purified by this method were free of any detectible 
contaminants. 

Results . Clones of the mature full length a- and /3-chains of DR2-Dw2 as 
well as truncated forms of each lacking the putative transmembrane and cytoplasmic tail 
regions have been constructed and inserted into a T7 expression vector for expression in 
E. colt. 
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The a-chain clone used herein contains a nucleotide substitution at base 
649, compared to the published sequence. This difference results in the substitution of a 
leucine residue in place of a valine residue at amino acid 217 of the full length product. 
This residue is within the transmembrane portion of the molecule and is therefore not 
5 present in the ATM construct. Because of the conservative nature of the leucine for 
valine substitution, and its positioning within the transmembrane region, the mutation 
was not considered a significant hinder to further experiments with regard to peptide 
binding and interaction with T-cells. 

Induction of both full-length and truncated constructs in W3110/DE3 
10 resulted in the substantial accumulation of protein at or near the expected size as 

evaluated by SDS-PAGE. Below is a summary of the lengths and expected molecular 
weights for each of the four proteins evaluated: 
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_ Protein Residues Expected M r 

Alpha FuIHength 229 25971 

Beta Full-length 237 27035 

Alpha ATM 192 22174 

5 Beta ATM 198 23024 

Following sonication of the cells and separation of the soluble and 
insoluble fractions by centrifugation, the target gene products were located in the soluble 
(inclusion body) fraction in all constructs. 

NH 2 -terminal amino acid sequencing of the first five residues of the DR2- 
10 Dw2 alpha chain full length and ATM products matched exactly the predicted amino acid 
sequence for the natural product except that a methionine residue was found in the first 
position, presumably due to its incomplete removal in vivo by E. coli. NH 2 -terminal 
sequence analysis of the first five residues of the two /?-chain gene products matched 
exactly the expected mature sequence for this chain with no methionine present in the 
15 first position, 

The E. coli-expressed a- and ^-chains bind peptide as well as isolated 
chains of the naturally derived material. Thus, the denatured single chains are able to 
achieve some level of conformation (without complex refolding procedures) that enables 
them to bind peptide. 

20 While complex refolding steps do not appear necessary, improvements in 

binding efficiency might be reaJ&ced following controlled renaturation of the single chain 
moieties. 

EXAMPLE 3 

Binding of Antigenic Peptide to Recombinant M HC Polypeptide 
25 Purified chains at a concentration of 200 /*g/ml and DR2 dimers at a 

concentration of 400 /ig/m! were incubated with radiolabeled MBP(83-102)Y 83 peptide or 
MBP(1-14) peptide at 37°C for 96 hours. Three /ig of the samples were analyzed on 
13.5% polyacrylamide SDS-PAGE under non-reducing conditions. Gels were stained 
with silver stain, autoradiographed arid radioactivity associated with each chain was 
30 counted. The percent of chains occupied with labeled peptide was calculated from the 
specific activity of the respective peptides. 

Results presented in Figure 1 show that rDR20(-TM) bound maximum 
peptide followed by rDR2a(+T t M) and rDR2/S(+TM). The rDR2or(-TM) showed no 
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significant binding of MBP(83-102)Y 83 peptide. In addition, recombinant chains showed 
increased binding as compared to equimolar amount of DR2 native heterodimer. The 
results were reproducible in 4 different experiments. The specificity of the peptide 
binding was demonstrated by incubating the chains with an equivalent amount of another 
5 epitope from the same myelin basic protein, MBP(1-14). In all cases, the binding of 
MBP(1-14) was insignificant. 

Association and Dissociation kinetics of rDR2 chains with radiolabeled 
peptides . The on rate kinetics of binding was measured similarly as described above. 
Chains at a concentration of 200 jug/ml were incubated at 37°C with labeled peptide. At 
10 various times, 15 fxl of sample was removed, chilled to 4°C and analyzed on 13.5% 

SDS-PAGE. The percent peptide occupancy was calculated from the specific activity as 
described above. The stability of chain-peptide complexes were compared at zero and at 
37°C (Figure 2). Single chain-peptide complexes appeared to be as stable as the 
heterodimeric native DR2-peptide complexes. 

15 

EXAMPLE 4 

P gptidg ginijing As$3y withJBiQt m yljited Antigenic Peptide 
Further binding studies were carried out using the four recombinant 
chains, rDR2a(4-TM), rDR2«(-TM), rDR20(+TM), rDR2/8(-TM), purified by 

20 conventional preparative chromatographic procedures as described above. Biotinylated- 
MBP (83-102) Y 83 , biotinylated-MBP (124-143) and biotinylated-MBP (1-14) peptides 
were used for the binding assay. Recombinant chains at a concentration of 0*2 mg/ml 
was incubated with 50 fold molar excess of biotinylated-MBP peptides. For the 
quantitation of the percentage of chain occupied with the biotinylated-peptide, resulting 

25 complexes were analyzed in a plate assay using enzyme conjugated avidin system. One 
mg per 50 ml affinity purified L243 monoclonal antibody, polyclonal anti-alpha and 
polyclonal anti-beta were coated for the DR2, a and j8 chains (with and without 
transmembrane regions), respectively on a 96 well microtiter plate. The polyclonal anti- 
cs and anti-jS antibodies were purified from immunized rabbit sera on a antigen-coupled 

30 sepharose-4B column. Calibration of the assay was achieved by coating know amounts of 
biotinylated-BSA. After capture of complexes, unbound peptide was removed by 
washing, followed by incubation with avidin-alkaline phosphatase. Unbound enzyme 
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conjugate was removed by washing, and a colorimetric substrate (Sigma 104) was added 
to the the detection of the enzymatic product by measuring absorbance of 405 nm. 

Native HLA-DR2 has been shown to have high affinity for peptides MBP 
(83-102)Y" and MBP (124-143) and no affinity for MBP (1-14). The results presented 
5 in Figure 3 show that purified recombinant polypeptide chains, like native heterodimers 
were capable of binding both MBP(83-102)Y* 3 and MBP(124-143) peptides. The MBP 
(1-14) did not show any significant binding to any purified chain preparations. The 
optimum pH for maximum binding was different than native DR2 in all cases. 

10 EXAMPLE 5 

T Cell Receptor Occupancy Assay 
The herpesvirus saimiri (HVS) transformed SS8T human T cell clone 
(provided by H. Wekerle, Max Planck Institute for Psychiatry , Munich, Germany, see y 
Weber et al Proc. Nail Acad. ScL USA 90:11049-11053 (1993)) which recognizes 

15 MBP(83-102)Y n in the context of DRB5 * 0101 was cultured in RPMI 1640 medium 

supplemented with 2 mM L-glutamine, 100 units/ml penicillin, 100 jig/ml streptomycin, 
10% fetal bovine serum (Hyclone) and 50 units/ml human IL-2 (ABI) at 37°C. Every 
alternate day the cells were transferred to fresh media. Based on the binding results 
shown in example 4, complexes of the four purified recombinant chains with MBP (83- 

20 102) Y 83 and MBP (124*143) peptides were prepared for the in vitro functional assay. 
Various complex preparations were added at a final concentration of 10% v/v in a 
microtiter tissue culture plate and the cells were added at a density of 20,000/well in 200 
fd media without EL-2 . After 48 hours incubation at 37°C, the supernatants were 
collected from each well to test for the increase in gamma-IFN leveL For the detection 

25 of gamma-IFN, Nunc Max i sorb 96 well plates were coated with anti-human gamma-IFN 
monoclonal antibody at a concentration of 0.5 /xg/well and incubated at 4°C overnight. 
The wells were blocked with 0.1% BSA, and samples were incubated at room 
temperature for 2 hours. The standard curve was generated by using human gamma-IFN 
with a dilution range of 1000 , 500, 100, 50, 10, 5, 1, 0.5, 0.1 units/ml ( 270 units/ml 

30 ~ 10.75 hg/mi ). Rabbit anti-human gamma-IFN was then added at a concentration of 
l^g/ml and plates were incubated at room temperature for additional 2 hours. Wells 
were extensively washed and incubated with HRP-conjugated goat anti-rabbit at a 
concentration of 800 ng/ml for 1 hour at 37°C, prior to the color developed using TMB 
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as a substrate. The reaction was stopped by 2 N sulfuric acid at 5 min. and the 
absorbance was measured at 450 nm. 

The increase in gamma IFN production of T cells has been shown to occur 
following TCR occupancy by specific ligands. The complexes of native DR2 with MBP 
5 (83- 102) Y" peptide were used as a positive control in this assay. The specificity of the 
increase in gamma IFN production was demonstrated by complexes of native DR2 or 
chains with irrelevant high affinity MBP (124-143) peptide in all experiments. Similarly, 
complexes of DR3 with MBP (83- 102) Y 83 peptide was used to demonstrate the restriction 
of SS8T cloned T cells by HLA-DR2. Results obtained with complexes of alpha chain 
10 (with or without Tm) and beta chain (with or without Tm) are presented in Figure 4 and 
Figure 5. 

These results clearly demonstrate that MHC class II single chain-peptide 
complexes function like complexes of antigenic peptide and native heterodimer. 

15 EXAMPLE 6 

Treatment of EAE using recombinants produced I-A* a chain 
This example demonstrates the ability of the recombinantly produced 
single chain complexes of the invention to induce anergy in vivo. These experiments 
demonstrate prevention of EAE in SJL/J mice. The a chains of IA S were recombinantly 
20 expressed using the methods of the invention. Briefly, PCR primers were prepared 

based on the gene sequence available in Genbank to isolate the gene from mouse spleen 
cells. The resultant gene was expressed using expression vector p27313, as described 
above. 

EAE was induced by adoptive transfer of 1 x 10 7 MBP(91-103) reactive T 
25 cells as described in Sharma etal. Proc. Natl Acad. ScL USA 88:11465-11469 (1991), 
The experiment was performed using a chain of IA* complexed with MBP 91-103 or 1- 
14 prepared as described above. On days 0, 2, 4, and 6, each mouse received 40jtg of 
complexes in as described in Sharma et al< The results are shown in Figure 6. As can 
be seen there, animals receiving PBS alone or irrelevant complex (I- A* complexed with 
30 MBP(1*14)) showed paralysis, whereas animals receiving relevant complex (I-A § 
complexed with MBP(9 1-103)) did not. 
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EXAMPLE 7 

Ubiquitin Fusion Expression System for Expression of MHC 

in E. cofi 

Use of yeast genes encoding ubiquitin in the production of fusion proteins 
in E. coli are described in Miller et al Biotechnology 7:698-704 (1989). In this system, 
the gene of interest is produced as a CMerminal fusion to the first 76 residues of the 
yeast ubiquitin gene contianing the UPP cleavage site. Cleavage at the junction of the 
two moieties results in production of a protein having an intact authentic N-terminal 
reside. 

Examples of such cleavage sites are the ubiquitin sequences recognized by 
the yeast ubiquitin protein peptidase (UPP) and ubiquitin specific protease (UBPI), UPP 
is the product of the YUH1 gene and can be expressed in active form in E. coli and used 
in vitro or in vivo to cleave ubiquitin-protein fusions at the fusion junction (Miller et al, 
supra) The efficiency of cleavage by UPP has been shown to be affected by the length of 
the fusion product and cleavage of fusions greater than 20Kd are often inefficient* 
However, UBP1 has been shown by Tobias and Varshavsky J. Biol Chem. 266:12021 
(1991) to have the same proteolytic properties of UPP but not affected by the size of the 
fusion. 

Taking into account that the predicted size of a ubiquitin-DR2 alpha fusion 
is M r 30738, and the finding that cleavage by UPP is inefficient with proteins of this 
size, UBP1 would be prefered for this system. 

Two approaches can be taken to make use of the ubiquitin system for 
production of -authentic " MHC II chains in E. coli. First, either UPP or UBP1 could be 
cloned and expressed in £1 coli and cell extracts from induced cells used in vitro to 
cleave the fusion protein (which would be made separately). Secondly, either UPP or 
UBP1 could be supplied in vivo for simultaneous expression with the MHC n chain 
resulting in the production of "authentic* MHC II directly from the crude cell extract. 

Construction of ubiquitin fusion vector p27340 . The sequence encoding the 76 amino- 
acid coding region of ubiquitin was amplified from yeast genomic DNA by PCR. The 
primers were designed based on the Ubi76 sequence in Genbank. The primers were as 
follows: 
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Ubi "top" primer 

BamHI 

5 \ . TCAGGATCCGATCGTGGAGGATGATTAA ATGCAAATTTTTGTCAAGACTTT 
GACTGGT ...3* (SEQ. ID. No, 16) 

The primer sequence includes the phi- 10 coupler region and the underlined 

sequence represents the actual ubiquitin 5* sequence. 

Ubi "reverse" primer 

EcoRI SacII 



5\.. TGAATTCCCGCGGAGTCTCAAGACTA AGTGC A AAGTGGA. . 3 ' (SEQ. ID. 
No. 17) 

15 The unique SacII site is generated by altering the base sequence so as to 

create the restriction site for fusion cloning but maintaining the amino acid sequence of 
the molecule. 

An expression vector for production of desired fusion products, p27340, 
was generated by digesting p27313, described above, with BamHI and EcoRI and 
20 ligating the 5500 bp fragment with the 259 bp ubi-76 PCR product digested with the 
same enzymes. 

p27340 which can be used to fuse any gene to the ubiquitin gene so as to 

make a fusion product which can then be specifically cleaved to yield a protein of desired 

N-terminus. The vector is selectable with ampicillin. 
25 The nucleic acid encoding the DR alpha chain lacking its transmembrane 

region was cloned in p27340 and was expressed as a fusion product of expected 

molecular weight of 30,758 daltons. SDS-PAGE gels showed the presence of a double 

band similiar to the expression products from plasmid p26495. 

Construction of p27351 and p27373 . The DR alpha chain sequences were 
30 amplified using PCR to generate the fxill length sequence as well as the sequence lacking 

the transmembrane and cytoplasmic domains. The PCR primers designed for the fusion 

of these sequences to the ubiquitin 76 sequence were as follows : 



35 
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Alpha-ubi76 primer( top strand) 
SacII 



5'... ATCCGCGGGGGC ATCAAAGAAGAACATGTGATCATC ...3' (SEQ. ID. No. 18) 

The sequence comprising the SacII site regenerates the ubiquitin cleavage 
site when fused to the ubi-76 gene. The underlined sequence represents the DR alpha 
chain without its initial methionine codon. 

Alpha F/L pnmer(bottom strand) 
EcoRI 

5\.. GTCXj AATTCTTACAGAGGCCCCCTGCGTT. . .3' (SEQ. ID. No. 19) 

Alpha ATM pnmcr(bottom strand) 
EcoRI 



S\„ GTCG AATTCTCAGTTCTCTGTAGTCTCTGGG AG. . .3' (SEQ. ID. No. 20) 
Alpha ATM- 10 primer (bottom strand) 

5^..ATCGAATTCTTAAGCATCAAAC^CCCAGTGCTT...3 , (SEQ. ID. No. 21) 

p2^351 was generated by digesting p27340 with SacII and EcoRI and 
ligating the 5740 bp fragment with the 589 bp fragment of the DR ATM alpha PCR 
product digested with the same enzymes. The plasmid p27373 expresses DR alpha 
lacking the transmembrane domain and ten more residues from the extracellular domain 
fused to Ubiquitin 76 sequence. 



Construction of p32941 . The DR alpha full-length sequence was obtained 
by PCR amplification of p27317. The PCR primers were designed so as to contain the 
restriction enzyme sites SacII and EcoRI and these were used to subclone the PCR- 
amplified fragment into p27340. 
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The plasmid was transformed into the expression host W3110/DE3. 
Following growth and IPTG induction, expression of the fusion protein of Ubiquitin 76 
+ DR alpha F/L of the expected molecular weight M r — 34,625 was observed. 



5 EXAMPLE 8 

Additional E. coli MHC II Expression Plasmids 
Construction of p28524 for expression of PR alpha ATM chain that is 
shorter by 10 residues . The plasmid p26495 expresses DR alphaATM chain in the pET 
expression system* Upon IPTG inductions, the SDS-PAGE gel shows a doublet of bands 
10 at the expected molecular weight. These bands were sequenced for the first 5 N-terminal 
residues and both gave the correct sequence for the alpha chain. Plasmid p28524 was 
constructed to generate a even more truncated version of DR alphaATM chain. The 
sequence was PGR amplified using the following primers: 
Top strand primer: 

15 

5 \ . . CGGG ATCCGATCGTGGAGG ATGATTA AATGATC AAAGA AGAACATGTGATC 
AT.. .3' (SEQ. ID. No. 22) 

Bottom strand primer: 

5 \ . . ATCG AA1TCTTAAGC ATC A A ACTCCC AGTGCTT. . . 3 * (SEQ. ID. No. 23) 

20 

The PCR product was digested with BamHl and EcoRl and cloned into p27313 digested 
with the same enzymes. 

Construction of E.coli expression plasmid with tetracycline resistance. The following 
plasmids were constructed for expression of MHC class n single chains in E.colL 

25 Plasmids p26495, p26496, p27316 and p27317 described above express ATM and full- 
length DR alpha and beta chains in the presence of ampicillin resistance. For scale-up 
culturing of E.coli strains, ampicillin is not an effective antibiotic as it is degraded 
rapidly due to the ^-lactamase secreted by the cells containing amp resistant plasmids. 
Therefore tetracycline resistance gene was cloned into the above plasmids to make them 

30 more stable under fermentation conditions. 

Construction of p27329 and p27330 . The tetracycline resistance gene was 
amplified by PCR using pBR322 as target DNA and the following PCR primers: 
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Top strand primer: 

5\ . .A TCTCGAG TTTGACAGCTTATCATCG. . .3' (SEQ. ID. No. 24) 
Ava 1 

Bottom strand primer: 

5 5 \ • • ATCXCGAGTCAGGTCGAGGTGGC. . .3' (SEQ. ID. No. 25) 
Ava 1 

p27329 was generated by linearizing p27316 with Aval and phosphatasing 
the 6245 bp fragment with calf intestinal phasphatase. This fragment was ligated to the 
Tetracycline PCR product digested with Aval. The resulting plasmid expresses 
10 DRB5*0101 full-length chain in the presence of both ampicillin and tetracycline markers. 

p27330 was generated by similar manipulations starting with Aval 
digested p273l7 and cloning in the Tetracycline PCR product. The resulting plasmid 
expresses DR alpha full-length chain in the presence of both ampicillin and tetracycline 
markers. 

15 Construction of p329129 and p33435 . The tetracycline gene was cloned 

into p28524 and p26496 by restriction digestion. Plasmids p26495 or p26496 were 
digested with Xbal and Aval. The resulting 3736 bp fragment was ligated to the 3507 
bp fragment generated by digestion of p27329 with Xbal-f Aval+Pstl. The resulting 
plasmids express DR alpha and beta ATM chains in the presence of ampicillin and 

20 tetracycline markers. 

EXAMPLE 9 

Fgrmentatipn pf E,<?qH $frafo$ Expressing MHC Class II 
Sfrglg Chain? 

A ten-liter microferm system was purchased from New Brunswick 
25 Scientific. The system included features which allow for monitoring and control over 
pH, dO a , temperature, and agitation rate. In addition, the system was supplied with a 
pump for use in continuous feed of media to the fermentation broth. 

All the MHC class II single chains expressed in the E. coli host 
W3110/DE3 were expressed under typical fermentation conditions to obtain high yields 
30 of proteins. Induction was carried out at an OD^of approximately 20. The cells were 
harvested 2 hours following induction and processed for inclusion body preparation. A 
typical fermentation run generates wet cell pastes on the scale of 500 g which can be 
frozen at -20 *C and processed for lysis in batches. 
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The above examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary 
skill in the art and are encompassed by the appended claims. All publications, patents, 
and patent applications cited herein are hereby incorporated by reference. 



WO 95/11702 



PCT/US94/12231 



46 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Anergin, Inc 

(B) STREET: 301 Penobscot Drive 

(C) CITY : Redwood City 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) POSTAL CODE (ZIP): 94063 

(G) TELEPHONE: (415) 361-8901 

(H) TELEFAX: (415) 361-8958 

(I) TELEX: 

(ii) TITLE OF INVENTION: PROKARYOTIC EXPRESSION OF MHC PROTEINS 
(iii) NUMBER OF SEQUENCES: 25 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1,0, Version #1.25 

<V) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO not yet assigned 

(B) FILING DATE: 25-OCT-1994 

(C) CLASSIFICATION: 

(Vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/143,575 

(B) FILING DATE: 25-OCT-1993 

(vii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Bastian, Kevin L. 

(B) REGISTRATION NUMBER: 34,774 

(C) REFERENCE /DOCKET NUMBER: 1405 8-2 1-PC 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 543-9600 

(B) TELEFAX: (415) 543-5043 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TATGGCTAGC ATGACTGGTG GACAGCAAAT GGGTGCCCTA GGCTTCGAAT CTAA 
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(2) INFORMATION FOR SEQ ID NO: 2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ACCGATCGTA CTGACCACCT GTCGTTTACC CAGCCCTAGG CTTCGAATCT AA 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
AATTCCTACG TA 

(2) INFORMATION FOR SEQ ID NO:4s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGATGCATCT AG 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGGGATCCGA TCGTGGAGGA TGATTAAATG ATCAAAGAAG AACATGTGAT CATC 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GTCGAATTCT TACAGAGGCC CCCTGCGTT 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATCGAATTCA GTTCTCTGTA GTCTCTGGGA G 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CGGGATCCGA TCGTGGAGGA TGATTAAATG GGGGACACCC GACCACGTT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTCGAATTCT CAGCTCACGA GTCCTGTTGG 
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(2) INFORMATION FOR SEQ ID NO; 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 

ATCGAATTCA CTTGCTCTGT GCAGATTCAG A 31 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 
<B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix ) FEATURE : 

(A) NAKE/KTYx CDS 

(B) LOCATION: 1..690 

( ix ) FEATURE : 

(A) NAME /KEY i misc_f eature 

(B) LOCATION: 577, .690 

<D) OTHER INFORMATION: /note- "Encodes the transmembrane 
region within HLA DR2~Dw2 Alpha Chain. ** 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 11: 

ATC AAA GAA GAA CAT CTC ATC ATC CAG GCC GAG TTC TAT CTG AAT CCT 48 
lie Lys Glu Glu Him Val lie lie Gin Ala Glu Phe Tyr Leu Asn Pro 
1 5 10 15 

GAC CAA TCA GGC CAG TTT ATG TTT GAC TTT GAT GGT GAT GAG ATT TTC 96 
Asp Gin Ser Gly Glu Phe Met Phe Asp Phe Asp Gly Asp Glu lie Phe 
20 25 30 

CAT GTG GAT ATG CCA AAG AAG GAG ACG GTC TGG CGG CTT GAA GAA TTT 144 
His Val Asp Met Ala Lys Lys Glu Thr Val Trp Arg Leu Glu Glu Phe 
35 40 45 

GGA CGA TTT GCC AGC TTT GAG GCT CAA GGT GCA TTG GCC AAC ATA GCT 192 
Gly Arg Phe Ala Ser Phe Glu Ala Gin Gly Ala Leu Ala Asn He Ala 
50 55 60 

GTG GAC AAA GCC AAC CTG GAA ATC ATG ACA AAG CGC TCC AAC TAT ACT 240 
Val Asp Lys Ala Asn Leu Glu He Met Thr Lys Arg Ser Asn Tyr Thr 
65 70 75 80 

CCG ATC ACC AAT GTA CCT CCA GAG GTA ACT GTG CTC ACG AAC AGC CCT 288 
Pro He Thr Asn Val Pro Pro Glu Val Thr Val Leu Thr Asn Ser Pro 
85 90 95 

GTG GAA CTG AGA GAG CCC AAC GTC CTC ATC TGT TTC ATC GAC AAG TTC 336 
Val Glu Leu Arg Glu Pro Asn Val Leu He Cys Phe He Asp LyB Phe 
100 105 110 
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ACC CCA CCA GTG GTC AAT GTC ACG TGG CTT CGA AAT GGA AAA CCT GTC 384 
Thr Pro Pro Val Val Asn Val Thr Trp Leu Arg Asn Gly Lys Pro val 
115 120 125 

ACC ACA GGA GTG TCA GAG ACA GTC TTC CTG CCC AGG GAA GAC CAC CTT 432 
Thr Thr Gly Val Ser Glu Thr Val Phe Leu Pro Arg Glu Asp His Leu 
130 135 140 

TTC CGC AAG TTC CAC TAT CTC CCC TTC CTG CCC TCA ACT GAG GAC GTT 480 
Phe Arg Lys Phe His Tyr Leu Pro Phe Leu Pro Ser Thr Glu Asp Val 
145 150 155 160 

TAC GAC TGC AGG GTG GAG CAC TGG GGC TTG GAT GAG CCT CTT CTC AAG 528 
Tyr Asp Cys Arg Val Glu His Trp Gly Leu Asp Glu Pro Leu Leu Lys 
165 170 175 

CAC TGG GAG TTT GAT GCT CCA AGC CCT CTC CCA GAG ACT ACA GAG AAC 576 
His Trp Glu Phe Asp Ala Pro Ser Pro Leu Pro Glu Thr Thr Glu Asn 
180 185 190 

GTG GTG TGT GCC CTG GGC CTG ACT GTG GGT CTG GTG GGC ATC ATT ATT 624 
Val Val Cys Ala Leu Gly Leu Thr Val Gly Leu Val Gly lie He He 
195 200 205 

GGG ACC ATC TTC ATC ATC AAG GGA GTG CGC AAA AGC AAT GCA GCA GAA 672 
Gly Thr He Phe He He Lys Gly Val Arg Lys Ser Asn Ala Ala Glu 
210 215 220 

CGC AGG GGG CCT CTG TAA 690 
Arg Arg Gly Pro Leu 
225 230 

(2) INFORMATION FOR SEQ ID NO: 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

( B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEs protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO* 12: 

He Lys Glu Glu His Val He He Gin Ala Glu Phe Tyr Leu Asn Pro 
1 5 10 15 

Asp Gin Ser Gly Glu Phe Met Phe Asp Phe Asp Gly Aop Glu He Phe 
20 25 30 

His Val Asp Met Ala Lys Lys Glu Thr Val Trp Arg Leu Glu Glu Phe 
35 40 45 

Gly Arg Phe Ala Ser Phe Glu Ala Gin Gly Ala Leu Ala Asn He Ala 
50 55 60 

Val Asp Lys Ala Asn Leu Glu He Met Thr Lys Arg Ser Asn Tyr Thr 
65 70 75 " 80 

Pro lie Thr Asn Val Pro Pro Glu Val Thr Val Leu Thr Asn Ser Pro 
85 90 95 

Val Glu Leu Arg Glu Pro Asn Val Leu He Cys Phe He Asp Lys Phe 
100 105 110 

Thr Pro Pro Val Val Asn Val Thr Trp Leu Arg Asn Gly Lys Pro Val 
115 120 125 



If II KJ x'«Ji JUL i VA 



51 

Thr Thr Gly Val Ser Glu Thr Val Phe Leu Pro Arg Glu Asp Hi© Leu 
130 135 140 

Phe Arg Lys Phe His Tyr Leu Pro Phe Leu Pro Ser Thr Glu Asp Val 
145 150 155 160 

Tyr Asp Cys Arg Val Glu His Trp Gly Leu Asp Glu Pro Leu Leu Lys 
165 170 175 

His Trp Glu Phe Asp Ala Pro Ser Pro Leu Pro Glu Thr Thr Glu Asn 
180 185 190 

Val Val cys Ala Leu Gly Leu Thr Val Gly Leu Val Gly lie lie lie 
195 200 205 

Gly Thr He Phe He He Lys Gly Val Arg Lys Ser Asn Ala Ala Glu 
210 215 220 

Arg Arg Gly Pro Leu 
225 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..714 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 595 714 

(D) OTHER INFORMATION: /note= "Encodes the trans-membrane 
region within HLA DR2-Dw2 Beta Chain. •* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 : 

GGG GAC ACC CGA CCA CGT TTC TTG CAG CAG GAT AAG TAT GAG TGT CAT 48 
Gly Asp Thr Arg Pro Arg Phe Leu Gin Gin Asp Lys Tyr Glu Cys His 
15 10 15 

TTC TTC AAC GGG ACG GAG CGG GTG CGG TTC CTG CAC AGA GAC ATC TAT 96 
Phe Phe Asn Gly Thr Glu Arg Val Arg Phe Leu His Arg Asp He Tyr 
20 25 30 

AAC CAA GAG GAG GAC TTG CGC TTC GAC AGC GAC GTG GGG GAG TAC CGG 144 
Asn Gin Glu Glu Asp Leu Arg Phe Asp Ser Asp Val Gly Glu Tyr Arg 
35 " 40 45 

GCG GTG ACG GAG CTG GGG CGG CCT GAC GCT GAG TAC TGG AAC AGC CAG 192 
Ala Val Thr Glu Leu Gly Arg Pro Asp Ala Glu Tyr Trp Asn Ser Gin 
50 55 60 

AAG GAC TTC CTG GAA GAC AGG CGC GCC GCG GTG GAC ACC TAC TGC AGA 240 
Lys Asp Phe Leu Glu Asp Arg Arg Ala Ala Val Asp Thr Tyr Cys Arg 
65 70 75 80 
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CAC AAC TAG GGG GTT GGT GAG AGC TTC ACA GTG CAG CGG CGA <5TT GAG 288 

His Asn Tyr Gly Val Gly Glu Ser Phe Thr Val Gin Ar§ Arg Val Glu 
85 90 95 

OCT AAG GTG ACT C5TG TAT CCT GCA AGG ACC C&G ACC dTG bA£ CAC CAd 336 
Pro Lye Val Tar Val Tyr Pro Ala Ar§ Thr Gin Thr Leu Gin Hie His 

ioo m .. w * UP, ■ 

AAC CTC CTG GTC ffcfe TCT GTG AGT GGT i:t6 TAT CCA GCC AGC ATT GAA 384 
Asn Leu Leu Val Cys Ser Val Ser Gly Pl>e Tyr Pro Ala Ser lie $lu 
115 12 b 125 

GTC AGG TGG TTC CGG AAC AGC? CAG <*AA GAG AAQ GQT G£G GTG GTG TC£ 432 
Val Arg Trp Phe Arg Ash £er Glh Glu Glu Lys Ala Gly Val Val Ser 
130 IS 5 140 

ACA GGC CTG ATT CAG AAT GGA GAC TGG ACC TTC CAG ACC CTG GTG ATG 480 
Thr Gly Leu lie Gin Asn Gly K%p Trp Thr Phe Gin Thr Leu Val Met 
145 150 155 160 

CTG GAA ACA GTT CCT CGA AGT GGA GAG GTT TAC ACC TGC CAA GTG GAG 528 
Leu Glu Thr Val Pro Arg Ser Gly Glu Val Tyr Thr Cys Gin Val Glu 

165 170 ,, 175 

CAC CCA AGC CTG ACC ACC CCT CTC ACA GTG GAA TGG AG A GCA CAG T<?T 576 
His Pro Ser Val Thr Ser Pro Leu Thr Val Glu Trp Arg Ala Gin Ser 
180 185 1&0 

GAA TCT GCA CAG AGC AAG ATG CTG ACT GGA GTC GGG GGC TTT GTG CTG 624 
Glu Ser Ala Gin Ser Lys Met Leu Ser Gly Val Gly Gly: Phe Vail Leu 
195 200 205 

GGC CTG CTC TTC CTT CGG GCC GGG CTA TTC ATC TAC TTC AAG AAT CAG 672 
Gly Leu Leu Phe Leu Gly Ala Gly Leu Phe lie Tyr Phe Lys Asn Glxi 
210 215 220 _ ^ 

AAA GGG CAC TCT CGA CTT CAC CCA ACA GGA CTC GTG' AGC TGA : 714 
Lys Gly His Ser Gly Leu His Pro Thr Gly Leu Val Ser 
225 230 235 

(2) INFORMATION TOR SEQ ID NO: 141 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

< ii ) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Asp Thr Arg Pro Arg Phe Leu Gin Glh Asp Lys T^r Glu Cys His 
1 5 10 v 15 

Phe Phe Asn Gly Thr- GlttOAr^ Val Arg Phi Le^ His Argr Aif 1 10 Tyr 
20 25 ' 30 

Aen Gin Glu Glu Asp Leu Arg Phe Asp Ser Asp Val Gly Glu Tyr Arg 
35 40 45 

Ala Val Thr Glu Leu Gly Arg Pro Asp Ala Glu Tyr Trp Asn S r Gin 
50 55 60 

Lys Asp Phe Leu Glu Asp *r^ Ar^ Ala Ala VaS -Asp Thr Tft: Cys Arg 
65 70 " 75 SO 

His Asn Tyr Gly Val Gly Glu Ser Phe Thr Val Gin Arg Arg Val Glu 
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85 90 95 

Pro Lys Val Thr Val Tyr Pro Ala Arg Thr Gin Thr Leu Gin His His 
100 105 110 

Asn Leu Leu Val Cys Ser Val Ser Gly Phe Tyr Pro Ala Ser lie Glu 
115 120 125 

Val Arg Xrp Phe Arg Asn Ser Gin Glu Glu Lys Ala Gly Val Val Ser 
130 135 140 

Thr Gly Leu lie Gin Asn Gly Asp Trp Thr Phe Gin Thr Leu Val Met 
145 150 155 160 

Leu Glu Thr Val Pro Arg Ser Gly Glu Val Tyr Thr Cys Gin Val Glu 
165 170 175 

His Pro Ser Val Thr Ser Pro Leu Thr Val Glu Trp Arg Ala Gin Ser 
180 185 190 

Glu Ser Ala Gin Ser Lys Met Leu Ser Gly Val Gly Gly Phe Val Leu 
195 200 205 

Gly Leu Leu Phe Leu Gly Ala Gly Leu Phe lie Tyr Phe Lys Asn Gin 
210 215 220 

Lys Gly His Ser Gly Leu His Pro Thr Gly Leu Val Ser 
225 230 235 



(2) INFORMATION FOR SEQ ID NO; 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 
CGGAATTCCG 10 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TCAGGATCCG ATCGTGGAGG ATGATTAAAT GCAAATTTTT GTCAAGACTT TGACTGGT 58 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 39 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TGAATTCCCG CGGAGTCTCA AGACTAAGTG CAAAGTGGA 
(2| INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
<CJ STRANDEDNESS: single 
(D> TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

ATCCGCGCCC GCATCAAAGA AGAACATGTG ATCATC 

<2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTHS 29 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GTCGAATTCT TACAGAGGCC CCCTGCGTT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GTCGAATTCT CAGTTCTCTG TAGTCTCTGG GAG 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ATCGAATTCT TAAGCATCAA ACTCCCAGTG CTT 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 53 base pairs 

(B) TYPE: nucleic acia 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGGGATCCGA TCGTGGAGGA TGATTAAATG ATCAAAGAAG AACATGTGAT CAT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ATCGAATTCT TAAGCATCAA ACTCCCAGTG CTT 
(2) INFORMATION FOR SEQ ID NO; 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATCTCGAGTT TGACAGCTTA TCATCG 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATCTCGAGTC AGGTCGAGGT GGC 
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WHAT IS CLAIMED IS : 

1. A composition comprising a recombinant MHC polypeptide which 
binds an antigenic peptide and which has altered glycosylation. 

5 2. A composition of claim 1, wherein the MHC polypeptide lacks a 

transmembrane domain. 

3. A composition of claim 1, wherein the MHC polypeptide is 
expressed by a prokaryotic host cell comprising an expression vector containing a 

10 nucleotide sequence encoding the MHC polypeptide. 

4. A composition of claim 3, wherein the prokaryotic cell is E. coli. 

5. A composition of claim 1, further comprising a second MHC 

15 polypeptide associated with the MHC polypeptide, thereby forming a heterodimeric MHC 
molecule. 

6. A composition of claim 1, wherein the polypeptide is encoded by a 
sequence from an MHC class II gene. 

20 

7. A composition of claim 1, wherein the polypeptide is a 0 subunit of 
an MHC class n molecule. 

8. A method of producing an MHC polypeptide, the method 

25 comprising: 

a) growing in culture a prokaryotic cell containing an expression 
vector comprising a nucleotide sequence encoding the MHC polypeptide under conditions 
such that the polypeptide is expressed; and 

b) isolating the MHC polypeptide. 

30 

9. A method of claim 7, wherein the cell comprises nucleotide 
sequences encoding two MHC polypeptides. 
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10. An isolated MHC polypeptide produced by the method of claim 8. 

11. A prokaryotic expression vector comprising a nucleotide sequence 
encoding an MHC polypeptide operably linked to a prokaryotic promoter sequence. 

12. A vector of claim 11, wherein the nucleotide sequence encoding the 
MHC polypeptide is operably linked to a nucleotide sequence encoding a signal 
sequence. 



13. A vector of claim 11, wherein the nucleotide sequence codes for a 
truncated MHC polypeptide. 

14. A vector of claim 13, wherein the MHC polypeptide lacks a 
transmembrane domain. 

15. A prokaryotic cell comprising the vector of claim 11. 

16. A cell of claim 15, wherein the cell is E. coli. 

17. An MHC-peptide complex consisting essentially of an antigenic 
peptide and an isolated recombinant MHC polypeptide of claim 1. 

18. A complex of claim 17, wherein the antigenic peptide is 
noncovalently associated with the MHC polypeptide. 

19. A complex of claim 17, wherein the MHC polypeptide is soluble. 

20. A complex of claim 17, wherein the antigenic peptide consists of 
between about 8 to about 18 amino acids. 



21. A complex of claim 17, wherein the peptide is an autoantigenic 

peptide associated with an autoimmune disease. 

/ 
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22. A complex of claim 17, wherein an epitope on the peptide is 
recognized by an autoreactive T cell associated with multiple sclerosis, rheumatoid 
arthritis, or myasthenia gravis. 

5 23. A complex of claim 17, wherein the peptide comprises residues 138- 

167 of human AChR a subunit, residues 84-102 of human MBP, or residues 148-162 of 
human MBP. 

24. A complex of claim 17 wherein, the MHC polypeptide is Class IT 

10 MHC 

25. A pharmaceutical composition comprising a pharmaceutically 
acceptable carrier and the MHOpeptide complex of claim 17. 

15 26. A pharmaceutical composition of claim 25, wherein the complex is 

embedded in a liposome. 
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